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A task of terawatts 


The world has an abundance of renewable energy to offer, the question is how to harness it. 


morning and sets every evening. The Sun shines down on 

Earth with a power of 174,000 terawatts. It lifts water from 
the surface of the sea to fall back down on the high mountains of 
the land; drives ocean currents from the tropics to the poles; spins 
winds around the world; and powers the green engine of life, pro- 
ducing practically every calorie ever consumed on Earth. 

Unfortunately, humanity is now, like Hamlet, too much in the Sun. 
Solar energy, reduced to waste heat, no longer leaves the planet as 
easily as it once did. Instead it tarries in the atmosphere, thanks to the 
ever stronger greenhouse effect. This change in the planet’s energy 
balance is a small one — but so great is the flow of energy from the 
Sun that, over decades and centuries, it will do terrible damage. 

Clearly, society must slow and eventually halt the rate at which the 
greenhouse effect is being strengthened. The best way to achieve this, 
as this journal has argued on many occasions, is to increase the cost of 
emitting carbon dioxide and other greenhouse gases, through a cap- 
and-trade scheme, or possibly a direct carbon tax. But for this strategy 
to work there must be attractive alternative sources of energy. 

The tendency among governments and traditional utilities to see 
renewable energy sources as oddities or add-ons is thus deeply mis- 
placed. These sources are the future; the only question is how fast we 
can harness their potential. On page 816 of this issue, we survey the 
carbon-free technologies currently on offer. For the most part they 
are driven by the Sun — either directly, or via the indirect means of 
wind, water and plants — and it is striking to see how much they 
have to offer. Many of these technologies offer more than a terawatt 
of additional global capacity, and all have ample room for growth. 
Today’s largest, hydropower and nuclear power, could grow by a 
factor of two or more — albeit with potentially troublesome non- 
climate consequences. But the greatest proportional capacity for 
growth lies with the current small-fry contributors. Wind turbines, 
for example, offer much greater possibilities for improvement than 
hydroelectric plants; the technologies for harnessing the wind far 
out to sea — or, for that matter, in the high currents of the jet-stream 
— are still comparatively immature. And even wind does not have 
the ultimate potential promised by the direct use of sunlight. In the 
future, new manufacturing methods that use alternative materials 
should make the capture of sunlight at least an order of magnitude 
cheaper than it is now. 


T= ultimate answer to humanity’s energy problems rises every 


Ideas and incentives 

The challenge is how to scale these new technologies up for a global 
market. A significant part of the answer is investment in focused 
research and development. Sadly, in most of the world, government 
spending on research in the energy sector has been stagnant or worse 
for decades. That has to change. Also crucial is an entrepreneurial cul- 
ture that can get new ideas to mass markets. That requires both a social 
network geared to innovation and a supply of ready capital. The news 


here is good: in areas such as California's Silicon Valley there is intense 
interest in solar energy, for example, and the industry is well positioned 
to exploit the same entrepreneurial vigour that has made computers 
and communications successful over the past few decades. 

Still, enthusiastic venture capitalists will not be enough. Firms with 
new energy technologies need to be able 
to access the capital of the stock mar- 
kets if they are to grow large enough to 
change the world. One way to make such 
firms attractive to investors isto guaran- — just how to save 
tee them sales. This can be done through the world.” 
regulation — insisting on a certain 
amount of renewable energy in all new developments, say — through 
subsidy, or through direct intervention; there is nothing to stop govern- 
ments from acting as bulk buyers of new capacity. Governments would 
be wise, though, not to indulge protectionist tendencies in the process. 
For investors in Texas to know that they can sell in Beijing, and for a 
Bangalorean to be sure ofa market in Brazil, is a great benefit. 


"The challenge is not 
how to save the world 
most efficiently 


Careful investment 

Finding economically efficient ways to stimulate the growth of a 
particular industry is not easy. The types of subsidy that have worked 
to date — such as the feed-in inducements that have built Germany its 
solar-cell industry — are expensive even at the megawatt scale. At the 
scale of tens and then hundreds of gigawatts they are likely to be unsus- 
tainable. Perhaps worse than the cost, though, is the risk that poorly 
designed subsidies will damage markets and thwart the development 
they seek to encourage. However, this is a challenge to encourage tech- 
nologies in a smart and flexible way, not a reason to leave the markets 
alone. The challenge is not how to save the world most efficiently 
— just how to save the world. Getting to the right place the wrong way 
is better than going nowhere, even if waste is involved. 

There is much else to do. For example, one solution to the prob- 
lem of energy storage may be found in the development of plug-in 
hybrid cars, which can take electricity out of a smart grid but also 
put it back (see page 810). 

Imaginative, energetic, organized action may require new insti- 
tutions. The next US president could send a strong signal of his 
country’s seriousness by creating a well-funded Department of 
Renewable Energy. The development and uptake of new technolo- 
gies around the world might also be encouraged by an International 
Renewable Energy Agency, charged with research, transmitting best 
practice, removing bottlenecks, and so on. But new institutions are 
not enough. There are interest groups that will naturally oppose this 
transition, and they will need to be dealt with by robust and resolute 
political leadership. 

Even to a technoenthusiast, the scale of what needs to be done is 
daunting. But the wind keeps blowing, the grass keeps growing, and 
the Sun keeps shining. The energy is there. a 
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Iceberg smash-up 
J. Geophys. Res. doi:10.1029/2008JFO001005 (2008) 


When icebergs collide, they create seismic tremors 
that are detectable thousands of kilometres 

away and might one day be used to track the 
disintegration of ice shelves. 

Douglas MacAyeal of the University of Chicago in 
Illinois and his colleagues placed four seismometers 
ona50-kilometre-long iceberg (marked with an 
asterisk) that sits aground in the Ross Sea, off 
Antarctica. Over several weeks, they recorded many 
series of tremors, which they attribute to an adrift 
neighbouring iceberg (pictured centre) grating 
against the stationary one. Each series contained 
thousands of ice-quakes per hour. 

The tremors share some similarities — but are 
distinct enough not to be confused — with recently 
discovered tremors that emanate from subduction 
zones and from the San Andreas fault. 


D. MACAYEAL/MODIS/NASA 


COMPARATIVE BIOLOGY 
Animal models 


Proc. R. Soc. B doi:10.1098/rspb.2008.0506 (2008) 
The hierarchy of stages involved as blood 
stem cells develop into the various blood-cell 
types does not differ significantly across all 


are probably reached. That squelches the idea 
that a layer of liquid, and therefore electrically 
insulating, helium, floating immiscibly on 
metallic hydrogen, could produce helium rain 
in these planets’ dense atmospheres. 


CLIMATOLOGY 


species of mammal. But the rate at which * 

these stem cells multiply does; it is faster the Winter sun 

smaller a typical adult’s mass. Geophys. Res. Lett. doi:10.1029/2008GL034160 
This finding from David Dingli of the (2008) 


Mayo Clinic in Rochester, Minnesota, and 
his colleagues confirms that mammals used 
in experiments make accurate models of the 
human blood-cell-production process. 
Dingli and his co-workers reached their 
conclusions after building a model of many 
aspects of blood-cell production across many 
species of mammal. They compared this with 


The weekly weather cycles detected in some 
big cities have been linked to higher levels 
of vehicle exhaust and factory emissions on 
weekdays than at weekends. But researchers 
have now found that this effect acts over a 
much larger area than urban centres. 


Arturo Sanchez-Lorenzo from the 


University of Barcelona and his colleagues 


the limited experimental data available. examined 44 years of climate data from 13 
weather stations across Spain. They discovered 


PLANETARY SCIENCE 


Soft metals 


Proc. Natl Acad. Sci. USA doi:10.1073/ 
pnas.0804609105 (2008) e 
Metallic helium isn’t as hard to make 

as scientists thought, according to Lars 
Stixrude of University College London 
and Raymond Jeanloz of the University 
of California, Berkeley. They calculate 
that the amount of squeezing needed to 
make the element conduct electricity 

is smaller when helium is hot than 
when it is cold; squeezing at close to 

30 million atmospheres turns helium 
metallic at a ‘mere’ 20,000 kelvin. 

This means that metallic helium 
should mix quite readily with metallic 
hydrogen in the fluid interiors of Jupiter 
and Saturn, where these conditions 
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that winter weekends tend to be drier and 
sunnier than weekdays. The result held for 
both urban and rural areas, suggesting that the 
hebdomadal cycles are caused by pollution’s 


ffect on regional atmospheric circulation 
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rather than dirty particles nucleating 
raindrops close to the source of smog. 

The authors observed the same 
relationship across Western Europe, and the 
reverse — wetter weekends — in the eastern 
North Atlantic. 


ASTROPHYSICS 


Slow-motion supernovae 


Astrophys. J. 682, 724-736 (2008) 
A survey of exploding stars shows that the 
farther away they are, the slower they seem 
to blow apart, as predicted on the basis of 
general relativity. The supernovae studied 
are known as type la supernovae, and are 
important for gauging the strength of dark 
energy — a mysterious force that seems to be 
pushing the Universe apart. These findings 
reassure astronomers that the far away 
supernovae behave like those nearby. 
Stéphane Blondin at the Harvard- 
Smithsonian Center for Astrophysics 
in Cambridge, Massachusetts, and his 
colleagues studied thirteen supernovae 
between 3.6 and 7.5 billion light years 
from Earth. The group compared 
several spectra from the more distant 
supernovae with those of nearer 
ones and found that the more distant 
explosions took longer to unfold. 


ZOOLOGY 
Predatory Lotharios 


J. Fish Biol. 73, 728-731 (2008) 

When mothers warn their daughters 
that boys are after only one thing, a 
meal probably isn’t what they have in 
mind. However, male fish of one species 
have been observed enticing females of 
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reverse — wetter weekends — in the eastern 
North Atlantic. 
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ZOOLOGY 
Predatory Lotharios 


J. Fish Biol. 73, 728-731 (2008) 

When mothers warn their daughters 
that boys are after only one thing, a 
meal probably isn’t what they have in 
mind. However, male fish of one species 
have been observed enticing females of 


a closely related species to spawn so that they 
can consume their eggs. 

Suzanne Gray of Simon Fraser University 
in Burnaby, Canada, and her colleagues 
report four instances in which a male 
Telmatherina sarasinorum was seen courting 
a female T. antoniae. The females were seen 
quivering, which is normally indicative 
of egg release. When each female finished 
quivering, the male immediately turned 
around and tried to eat her eggs. 

Gray terms this behaviour ‘sneaky egg- 
eating’ and suggests it may have arisen in 
part because of a paucity of rich nutritional 
sources in Lake Matano, the Indonesian 
waterway where the fish live. 


HUMAN HISTORY 
Digging up data 


Proc. Natl Acad. Sci. USA 105, 10693-10698 (2008) 
Agriculture began ina region of the Middle 
East known as the Fertile Crescent. 
That it diffused across Europe 
as human populations did 
is broadly accepted. 
But how it moved into 
southern Africa has 
been little studied. 

Brenna Henn 
of Stanford University, 
California, and her 
colleagues examined the 
frequency of two alternative 
versions of part of the 
Y-chromosome in 13 
African populations. They 
also compared 10 short, repeating DNA 
sequences known as ‘microsatellites; which 
vary in number across populations. 

The analysis shows that early farmers 
in what is now Tanzania bred with local 
populations in south-central Africa. This 
suggests that pastoralism and some types of 
cultivation were spread across the continent 
by people who practised them, rather than 
by word of mouth — and debunks the idea 
that farming reached southern Africa from 
the west. 


IMMUNOLOGY 


Successful delivery 


Cell doi: 10.1016/j.cell.2008.06.034 (2008) 
Researchers have for the first time reversed 
symptoms of HIV infection in a living animal 
using the technique of RNA interference. They 
constructed an antibody that targets T cells 

— in which HIV lurks — and linked it to a 
peptide carrying small RNA molecules, called 
siRNAs. The peptide helps these siRNAs enter 
T cells, where they silence certain host and 


virus genes crucial to the virus’s replication. 
Premlata Shankar, now at the Texas Tech 
University Health Sciences Center in El Paso, 
Sang-Kyung Lee of Hanyang University in 
Seoul, South Korea, and their colleagues 
injected the construct into mice genetically 
engineered to be easy to infect with HIV. The 
construct protected the mice from infection. It 
also restored the suppressed immune systems 
of mice that bore HIV-infected immune cells. 


GENETICS 


Appendages of note 


Science 321, 836-838 (2008) 
In about 4,000 species of flowering plant, 
individuals can grow both male and 
hermaphrodite flowers. This curious trait is 
called andromonoecy, and it has evolved on 
many separate occasions. Now, Abdelhafid 
Bendahmane of the National Institute for 
Agronomic Research in Evry, France, and his 
colleagues reveal the genetic 
mechanism behind this 
strategy in the melon 
Cucumis melo (pictured 
left). 

They found that 
andromonoccy results from 
a single base change in a gene 

that encodes an enzyme that 
is involved in producing a 
hormone called ethylene. 
This hormone inhibits the 
development ofa plant’s male 
sex organs. Andromonoecious 
melons have two copies of the altered gene, 
and consequently produce less enzyme. 
This means that any female flowers they 
would otherwise have grown are turned into 
hermaphroditic blooms. 


MATERIALS SCIENCE 


Light beat 


Soft Matter doi:10.1039/b805434g (2008) 

A polymer developed at Wright-Patterson 
Air Force Base in Ohio can move up and 
down as fast as a hummingbird’s flapping 
wings when it is hit with laser light. 

Timothy Bunning and his collaborators 
prepared a network of liquid crystal polymer 
containing azobenzene molecules, which 
act as linking groups. The bonds in these 
azo groups change reversibly from one 
geometrical form to another when exposed to 
ultraviolet laser light of a certain wavelength. 
This makes the polymer bend like a 
cantilever. As the laser’s power is cranked 
up the cantilever moves faster. If polarizing 
laser light is shone on the cantilever, it moves 
through a different angle. 
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RESEARCH HIGHLIGHTS 


JOURNAL CLUB 


Francis Albarede 
Ecole Normale Supérieure, Lyon, 
France 


A geochemist wonders about the 
Solar System's true age. 


Scientists have long looked at the 
constituent elements of meteorites 
to find out how old the Sun and 

its planets are. The most perfect 
example of the oldest meteorites — 
those that formed at the same time 
as the planets — broke up and fell 
as a large shower near Pueblito de 
Allende in Mexico in 1969. This was 
named the Allende chondrite, and 
was recently the subject of a study 
by Jim Connelly of the University of 
Texas at Austin and his colleagues. 

Meteorites are often dated by 
measuring how much aluminium-26 
they contain. This isotope decays 
at a rate that allows researchers 
to tell when one meteorite is older 
than another, but too fast to work 
out these rocks’ absolute ages. For 
the relative ages to be accurate, 
however, aluminium-26 must have 
been spread evenly among the 
protoplanetary debris from which 
meteorites were born. If it was not, 
this isotope would reflect where 
they formed as well as when they 
formed, and meteorite chronologies 
would be higgledy piggledy. 

But true ages can be calculated 
from lead isotopes. Until recently, 
lead had not been measured in both 
of two common parts of the oldest 
meteorites — chrondules and 
calcium-aluminium-rich inclusions 
— for any one rock, and there was 
no way of telling whether different 
rocks formed in the same bit of the 
nascent Solar System. But the lead 
isotopes in both chrondules and 
calcium-aluminium-rich inclusions 
can be counted in Allende. 

Connelly and his team have 
confirmed an age difference 
between the chondrules and 
calcium-aluminium-rich inclusions 
that had been inferred from 
aluminium-26 measurements. This 
means that the relative meteorite 
chronologies are correct, and that 
aluminium-26 was indeed evenly 
distributed when the Sun ignited 
(J. N. Connelly et al. Astrophys. J. 
675, L121-L124; 2008). If this is 
right, then the Solar System must 
be 4,567.5 million years old. 


Discuss this paper at http://blogs. 
nature.com/nature/journalclub 
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Top climate-impacts programme shut 


The lay-off last week of a senior political 
scientist involved in helping poor countries 
prepare for climate change has exposed a stark 
division in opinions on the core purpose of a 
key US climate-research institution. 

The National Center for Atmospheric 
Research (NCAR) in Boulder, Colorado, says 
its hand was forced by several years of largely 
stagnant budgets. These have resulted in the 
loss of 12% of its core workforce during the 
past five years — ironically, during a period in 
which climate change moved to centre stage in 
Washington DC. 

But the lay-off of Mickey Glantz, a high- 
profile researcher who has chalked up some 
34 years at the institution, has raised questions 
about whether NCAR is turning its back on 
the social sciences at a time when international 
efforts are focusing on mitigation and adapta- 
tion. Certainly Glantz believes this is the case, 
saying budgets are just an excuse and that the 
leadership is defensively “circling the wagons”. 
His dismissal is tied to NCAR's announcement 
last week that it is shutting its Center for Capac- 
ity Building, the highly respected outreach pro- 
gramme that Glantz has run since 2005. 

But others associated with the programme 
say they believe the NCAR leadership still 
backs them despite its budgetary problems. 
“I dort think this has anything to do with 
shutting down social science at NCAR,’ says 
Linda Mearns, who recently stepped down as 
director of the Institute for the Study of Society 


Physicists await 


Rumours are swirling that a European satellite 
mission may have detected dark matter, the 
mysterious particles thought to make up as 
much of 85% of all matter in the Universe. 
Nature has learned that the PAMELA (Pay- 
load for Antimatter Matter Exploration and 
Light-nuclei Astrophysics) mission — a col- 
laboration between Italy, Russia, Germany and 
Sweden — has detected a surplus of high-energy 
antielectrons whizzing through space. The anti- 
electrons, also called positrons, could be the 
clearest signature yet of the dark matter lurking 
in the Milky Way, according to Dan Hooper, a 
theoretical physicist at Fermilab in Batavia, Illi- 
nois. “If it’s true, it’s a major discovery,’ he says. 
Previous space and balloon missions saw 
hints of the same positron surplus in the 
1990s. But their energy range was limited and 
their measurements had high uncertainty. 
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dark-matter confirmation 


PAMELA is, in principle, capable of detecting 
higher-energy positrons with far better accu- 
racy than any other mission to date. 

PAMELAS principal investigator, Piergiorgio 
Picozza, a physicist at the University of Rome Tor 
Vergata, declined to comment on the findings 
until they have been published. However, those 
who have seen the PAMELA data think they 
are telling. “This is the type of signal that one 
would expect [from dark matter]? says Graciela 
Gelmini, a theoretical physicist at the University 
of California, Los Angeles, who has studied the 
team’s results. But Gelmini adds, the measure- 
ment is tricky, “so one has to be cautious”. 

Most physicists believe that the Universe 
is filled with particles that rarely — if ever 
— interact with the stuff of stars, planets and 
people. This ‘dark matter’ has so far only been 
indirectly detected through its gravitational 
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pull on galaxies, and its influence on the 
curvature of the Universe itself. 

But PAMELA may be seeing another, more 
concrete signature of the ghostly particles. The 
satellite has found an unexpected increase in 
the number of high-energy positrons flitting 
through the Galaxy, says Bob McElrath, a the- 
orist at CERN, Europe's high-energy physics 
laboratory near Geneva in Switzerland. The 
findings were briefly flashed across the screen 
at this month’s International Conference on 
High Energy Physics in Philadelphia, Pennsyl- 
vania, and they've got the community abuzz, 
McElrath says. “I’ve had people e-mailing and 
asking me to try and get the slides” 

PAMELA’s positron surplus could come 
from dark matter particles described by the 
supersymmetry theory. This predicts the exist- 
ence of super-heavy counterparts to everyday 
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and Environment there. Mearns says social 
scientists within the institute will continue to 
work with physical scientists at NCAR on inte- 
grated research projects. “And that’s the proper 
role for an institute in social science at NCAR” 

On the other side, some scientists are ques- 
tioning whether the institution has done 
enough to maintain, let alone build, its exper- 
tise in the physical sciences, particularly in 
climate modelling. These questions have been 
driven home by the departure 
of key scientists, including 
William Collins, who helped 
oversee NCAR's climate model- 
ling programme. “I can tell you 
in the science divisions here, 
it's the worst mood people have 
seen in a long time, and one 
reflection is people walking 
away, says Caspar Ammann, 
a palaeoclimatologist at the 
institution. 

NCAR’s new director, Eric 
Barron, who took over in July, 
says the institution is in an 
“interesting position’, caught 


Mickey Glantz’s social science 
department has been axed. 


positions throughout the institution “that 
are of critical importance” by eliminating a 
single programme that he says cost upwards of 
$730,000 annually. 

NCAR’ base budget — almost $88.5 million 
in the fiscal year 2008 — comes from the US 
National Science Foundation, although the 
institution receives significant funding from 
other federal agencies as well. In the fiscal year 
2007, its overall budget came to $149.3 million. 
Although current appropria- 
tions bills in Congress would 
increase NCAR's budget, few 
expect this legislation to pass 
in an election year. Congress 
is likely to wind up passing 
a “continuing resolution” 
later this autumn that would 
effectively freeze current 
spending levels until at least 
early next year. 

Roger Pielke Jr, a climate 
policy expert at the Center 
for Science and Technology 
Policy Research at the Univer- 
sity of Colorado at Boulder, 


between a dismal budgetary 
outlook and ongoing concerns about where 
NCAR should direct its limited resources. 
“A number of people are saying that our climate 
modelling programme has taken too big of a 
hit. People are saying very loudly that NCAR 
is not setting its priorities the way it should,’ he 
says. “The simple fact of the matter is that years 
of tight budgets are coming home to roost.” 
Barron says he supports the social science 
mission but was able to preserve several 


says it’s not clear why Glantz 
was singled out or, more broadly, how NCAR 
is addressing its fiscal situation. “There’s really 
no transparency in how these decisions are 
made,’ he says. 

Glantz, who has been guaranteed one year’s 
salary, says he plans to stay on for a while, 
although such courtesies will not be extended 
to his staff, including an administrative posi- 
tion and two researchers. a 
Jeff Tollefson 


particles. The lightest supersymmetrical 
particles are expected to be both massive and 
stable — making them prime candidates for dark 
matter. Occasionally, theorists believe, two of 
these particles will smash together and annihilate 
each other ina burst of energy. The annihilation 
will create a stream of more conventional parti- 
cles that will eventually decay, leaving energetic 
electrons and positrons. In other words, the posi- 
trons detected by PAMELA could be the direct 
result of dark-matter annihilations. 

The detection is the second dark-matter 
claim by Italians in recent months. In April, 
scientists at the DAMA/LIBRA (Dark Matter/ 
Large Sodium Iodide Bulk for Rare Processes) 
experiment, located beneath Italy’s Gran Sasso 
mountain, claimed to have seen dark-matter 
particles (see Nature 452, 918; 2008). The 
PAMELA results are unconnected, and their 
suggested mass range for dark matter seems to 
contradict the DAMA claim. 

PAMELAs findings are far from certain. 


Identifying fast-moving positrons is extremely 
difficult, says Stefan Schael, a physicist at 
RWTH Aachen University in Germany. Their 
positive charge and high energies can make 
them look like ordinary protons, he says. Ide- 
ally, experiments would have two detectors 
capable of telling the difference, but PAMELA 
only has one. “They have only one chance for 
this identification,” Schael says. “This is the 
main challenge for the group.” 

Even if the surplus exists, it could be from 
nearby astronomical sources. Objects such 
as neutron stars, pulsars and X-ray binary 
stars are capable of making energetic posi- 
trons that would fool PAMELA. “When the 
data are released there will certainly be a large 
number of people looking at the question,’ says 
Hooper. 

For now, however, the community is waiting 
with bated breath. As McElrath says: “We all 
wonder what's going on up there.” a 
Geoff Brumfiel 
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Biodiversity body 
‘lacks science’ 


Swedish researchers have launched a scathing 
attack on the scientific credentials of an 
international advisory body on biodiversity, 
warning that its effectiveness is being 
undermined by the increasing dominance of 
politicians and professional negotiators. 

Their concerns about the work of the 
scientific body that advises the Convention on 
Biological Diversity (CBD) are widely shared, 
the convention's own executive secretary, 
Ahmed Djoghlaf, has told Nature. The 
convention has been signed by 168 countries 
who pledge to significantly reduce the current 
rate of biodiversity loss by 2010. Article 25 
of the convention states that government 
representatives shall be “competent in the 
relevant field of expertise", but according to the 
Swedes, this is often not the case. 

Ina letter published in Conservation Biology, 
the ten scientists in the Swedish delegation to 
the CBD say that some parties to the convention 
are clearly trying to move away from science so 
that the convention does not interfere with trade 
and economic growth (L. Laikre et al. Conserv. 
Biol. 22, 814-815; 2008). 

Per Wramner of Sédertérn University College 
in Flemingsberg, who is one of the letter's 
authors, says that the February CBD meeting in 
Rome pushed them to act after it became bogged 
down in political wrangling and semantics. “This 
last meeting was a disaster from the scientific 
perspective,” says Wramner, who chairs the 
Swedish government's CBD advisory group. 

“Mexico and the European Union also expressed 
concern that there are too many new issues of 
procedure and of a policy nature,” says Djoghlaf. 

Conservation scientist Michael Stocking of 
the University of East Anglia, Norwich, UK, says 
that the nomination system is “the core of the 
problem, in that these tend to be government 
nominees...not scientists who are up to date 
with the literature". Countries that fund the CBD 
will have to insist on change for it to actually 
happen, says Stocking, who is vice-chair of the 
Scientific and Technical Advisory Panel for the 
Global Environment Facility, which administers 
the funding for the CBD. 

The concerns come amid attempts led by 
France to create a new international science 
policy group on biodiversity. Modelled on 
the same independent framework as the 
Intergovernmental Panel on Climate Change, this 
new body could mitigate some of the recently 
raised concerns. A ‘concept note’ for the new 
group was circulated last month by France. a 
Daniel Cressey 
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it's the worst mood people have 
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reflection is people walking 
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says the institution is in an 
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positions throughout the institution “that 
are of critical importance” by eliminating a 
single programme that he says cost upwards of 
$730,000 annually. 

NCAR’ base budget — almost $88.5 million 
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Although current appropria- 
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expect this legislation to pass 
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is likely to wind up passing 
a “continuing resolution” 
later this autumn that would 
effectively freeze current 
spending levels until at least 
early next year. 

Roger Pielke Jr, a climate 
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for Science and Technology 
Policy Research at the Univer- 
sity of Colorado at Boulder, 
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modelling programme has taken too big of a 
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is not setting its priorities the way it should,’ he 
says. “The simple fact of the matter is that years 
of tight budgets are coming home to roost.” 
Barron says he supports the social science 
mission but was able to preserve several 


says it’s not clear why Glantz 
was singled out or, more broadly, how NCAR 
is addressing its fiscal situation. “There’s really 
no transparency in how these decisions are 
made,’ he says. 

Glantz, who has been guaranteed one year’s 
salary, says he plans to stay on for a while, 
although such courtesies will not be extended 
to his staff, including an administrative posi- 
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particles. The lightest supersymmetrical 
particles are expected to be both massive and 
stable — making them prime candidates for dark 
matter. Occasionally, theorists believe, two of 
these particles will smash together and annihilate 
each other ina burst of energy. The annihilation 
will create a stream of more conventional parti- 
cles that will eventually decay, leaving energetic 
electrons and positrons. In other words, the posi- 
trons detected by PAMELA could be the direct 
result of dark-matter annihilations. 

The detection is the second dark-matter 
claim by Italians in recent months. In April, 
scientists at the DAMA/LIBRA (Dark Matter/ 
Large Sodium Iodide Bulk for Rare Processes) 
experiment, located beneath Italy’s Gran Sasso 
mountain, claimed to have seen dark-matter 
particles (see Nature 452, 918; 2008). The 
PAMELA results are unconnected, and their 
suggested mass range for dark matter seems to 
contradict the DAMA claim. 

PAMELAs findings are far from certain. 


Identifying fast-moving positrons is extremely 
difficult, says Stefan Schael, a physicist at 
RWTH Aachen University in Germany. Their 
positive charge and high energies can make 
them look like ordinary protons, he says. Ide- 
ally, experiments would have two detectors 
capable of telling the difference, but PAMELA 
only has one. “They have only one chance for 
this identification,” Schael says. “This is the 
main challenge for the group.” 

Even if the surplus exists, it could be from 
nearby astronomical sources. Objects such 
as neutron stars, pulsars and X-ray binary 
stars are capable of making energetic posi- 
trons that would fool PAMELA. “When the 
data are released there will certainly be a large 
number of people looking at the question,’ says 
Hooper. 

For now, however, the community is waiting 
with bated breath. As McElrath says: “We all 
wonder what's going on up there.” a 
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Swedish researchers have launched a scathing 
attack on the scientific credentials of an 
international advisory body on biodiversity, 
warning that its effectiveness is being 
undermined by the increasing dominance of 
politicians and professional negotiators. 

Their concerns about the work of the 
scientific body that advises the Convention on 
Biological Diversity (CBD) are widely shared, 
the convention's own executive secretary, 
Ahmed Djoghlaf, has told Nature. The 
convention has been signed by 168 countries 
who pledge to significantly reduce the current 
rate of biodiversity loss by 2010. Article 25 
of the convention states that government 
representatives shall be “competent in the 
relevant field of expertise", but according to the 
Swedes, this is often not the case. 
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the ten scientists in the Swedish delegation to 
the CBD say that some parties to the convention 
are clearly trying to move away from science so 
that the convention does not interfere with trade 
and economic growth (L. Laikre et al. Conserv. 
Biol. 22, 814-815; 2008). 

Per Wramner of Sédertérn University College 
in Flemingsberg, who is one of the letter's 
authors, says that the February CBD meeting in 
Rome pushed them to act after it became bogged 
down in political wrangling and semantics. “This 
last meeting was a disaster from the scientific 
perspective,” says Wramner, who chairs the 
Swedish government's CBD advisory group. 

“Mexico and the European Union also expressed 
concern that there are too many new issues of 
procedure and of a policy nature,” says Djoghlaf. 

Conservation scientist Michael Stocking of 
the University of East Anglia, Norwich, UK, says 
that the nomination system is “the core of the 
problem, in that these tend to be government 
nominees...not scientists who are up to date 
with the literature". Countries that fund the CBD 
will have to insist on change for it to actually 
happen, says Stocking, who is vice-chair of the 
Scientific and Technical Advisory Panel for the 
Global Environment Facility, which administers 
the funding for the CBD. 

The concerns come amid attempts led by 
France to create a new international science 
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the same independent framework as the 
Intergovernmental Panel on Climate Change, this 
new body could mitigate some of the recently 
raised concerns. A ‘concept note’ for the new 
group was circulated last month by France. a 
Daniel Cressey 


809 


FOUR WHEELS GOOD? 


With the world’s love of cars showing little sign of abating, manufacturers are under 
increasing pressure to make vehicles less polluting and oil dependent. Duncan 
Graham-Rowe explores some of the technologies that could keep us on the road. 


One way to reduce the energy cars 
consume is to recycle it: Hybrid 
vehicles use the combustion engine 
to charge up a battery that provides 
extra power to the engine when 
accelerating, thus reducing fuel 
consumption. ‘Regenerative braking’, 
in which the electric motor provides 
resistance to the drive train to help 
slow the vehicle, converts the kinetic 
energy into electricity, which is then 
stored for later use. The benefits of 
hybrids can really be seen during town 
driving — the fuel savings may be only 
a few per cent on longer trips. Fleets 
of hybrids could also provide a place 
to store electricty ona large scale, 
helping in the use of various renewable 
technologies (see page 805). 


HYBRIDS KXKKKKKKKKK 


Fuel tank 


Battery 


™ Electric motor in use throughout ™ Battery power used 
® Petrol engine used ™ Battery recharging 


Electric motor 


LASER INJECTION KKKK ENGENIUS KKKKKKKKK 
One goal is to make existing combustion engines 


iseambeari more efficient. Last year, the gauntlet was thrown 

Exhaust Intake down with the X-Prize Foundation’s Automotive 
/ valve X Prize, a US$10-million bounty for whoever can 
develop a car that can do 100 miles per gallon. 

Automotive engineering company Ricardo 
UK, based in Sussex, has made some headway, 
designing an engine that can switch between two- 
stroke and four-stroke modes. The company claims 
this could reduce fuel consumption by 27% and 
emissions by a similar amount. Four-stroke engines 
carry out the four stages of air intake, compression, 
combustion and exhaust in four strokes of a piston, 
whereas two strokes take just two strokes of the 
piston. Four strokes are more fuel-efficient at 
constant high speeds, whereas two-strokes are 
more efficient when accelerating or pulling a heavy 
load. Software in Ricardo’s concept engine, 2/4 
SIGHT, controls hydroelectric valves that switch 
between four-stroke mode during cruising to two- 
stroke mode when accelerating or hill climbing. 


Starting | Normal driving Acceleration Deceleration Stopping and starting 


valve 


Replacing spark plugs with laser-pulse 
ignition systems may sound over the top, 

gS but it makes sense, says Andrew Scarisbrick, 
\¢ Injector — supervisor for UK government and university 
collaborations at Ford Motor Company's 
Dunton Research and Engineering Centre 
near Basildon. Laser ignition systems can 
reduce fuel consumption and emissions 
because they give better control of where 

in the cylinder the spark occurs. Ideally, the 
spark should be as far from the cylinder 
walls as possible, so that the flame front is 
less able to form harmful nitrous oxides. But 
this technology is still a good few years off, 
Scarisbrick says. “At the moment the cost is 
astronomic.” 


Exhaust 
manifold 


plug 


Piston 
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BETTER BATTERIES K€XXKKKKKKKKKKKKKKKKKKKKXXXXKK 


One reason electric vehicles have never manufacturers, including Toyota with the nanostructure of lithium iron phosphate 
taken off is the lack of decent battery Prius hybrid vehicle, have opted for nickel surrounding the cathode that prevents it 
technologies. Lithium-ion batteries are metal hydride batteries, which pack less from expanding and contracting during 
currently the best candidates for cars punch than the lithium-ion ones but last charging and discharging, and so stops 


because they deliver power much more longer and are safer. the cathode wearing down so quickly. 
quickly than others. But they only carry Companies such as Massachusetts- According to the company, its batteries 
enough charge to last for around 100 based A123Systems are finding new ways have a potential lifetime ten times that of 


miles before needing recharging. They to squeeze more life out of a battery. conventional ones. And although they are 
also have to be replaced every couple of The cathodes within batteries degrade not as powerful as the regular lithium-ion 
years. And in extreme circumstances they with repeated use and recharging. So batteries, they are a vast improvement on 
can catch fire or explode. So, many car A123Systems has developed a birdcage-like the nickel metal hydride type. 


BETTER BY DESIGN KRKK 


The electric two-seater renewable 
energy vehicle (TREV), developed 
at the University of South Australia 
in Adelaide, was designed to carry 
only two people because, according 
to its creators, 90% of urban trips 


require no more than two seats. 
Thinking along similar lines, the 
Californian company Aptera recently 
launched a two-seater electric car 
(pictured) that it claims has the most 
energy-efficient and lowest-drag shape 


CONTROL FREAK KXXXKK 


Even subtle alterations to the way cars handle 
on roads can improve fuel efficiency. Drive-by- 
wire technology would remove the mechanical 
linkages between the controls and the brakes, 


throttle and steering mechanisms. 
Computers continually interrogate 
sensors about the car's handling and 
the road conditions to calculate, for 
example, the optimal throttle position 
at that moment. Most countries 
currently require a mechanical linkage 
between the steering wheel 
and wheels themselves, so 
manufacturers would 

have to persuade 
governments 
that the 
technology 

is safe. 


HYDROGEN FUEL KXXKKKKK 


Widely tipped to replace oil as a 
transportation fuel, hydrogen has 
nearly three times the energy density 
of petrol by mass, and when used to 
power fuel cells it produces only steam 
as a waste product. Most large car 
manufacturers are developing some 
form of hydrogen-powered car. 

But there are major obstacles to 
overcome before hydrogen could 
become a mainstream fuel — how 
to produce, transport and store 
it. “Hydrogen requires a huge 
infrastructure change that's not 
likely to happen for a long time,” says 
David Sims-Williams, an engineer at 
Durham University, UK. Producing it 
using renewable sources of energy 
is extremely inefficient, and the vast 
majority of hydrogen is currently 
produced from coal or natural gas, 
which is less than ideal. 


Another major problem is that, even 
in liquid form, hydrogen has a tenth the 
density of water. So hydrogen has less 


thana third of the energy density of petrol 


by volume. As things stand, a car's fuel 
tank would not only have to be cooled 
to below —250°C to keep the hydrogen 
liquid but would also have to be many 
times larger than existing ones. 

In 2003, the US Department of Energy 
issued a challenge to scientists to 
develop new materials that can store 
enough hydrogen to make up 6% of 
the material's total mass. Frantisek 
Svec at Lawrence Berkeley National 
Laboratory in California has developed 
a nanoporous polymer that adsorbs 
hydrogen atoms reversibly on its 
surface, allowing much higher densities 
to be stored. But the hydrogen is only 
1.5% of the polymer’s mass, still way 
short of the target. 
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channelled through a 
plate to the anode 


that can surround two people sitting 
side-by-side. Even its wing mirrors are 
replaced with rear-facing cameras and 
internal monitors. Use of composite 
materials makes the three-wheeler very 
light, just 680 kilograms, placing less 
of a drain on its plug-in electric motor. 
According to the company, it can 
go from 0 to 60 miles per hour 
in ten seconds and has a top 
speed of 85 mph. 


mw The anode causes hydrogen 
to split into cations (H*) 
and electrons (e-) 


Oxidant N 


Only the cations pass 
through the membrane. 
The electrons travel to 
the cathode creating 

a circuit 


4 


Ww 


At the cathode, 
hydrogen is oxidized, 
forming water 
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The AIDS fight: looking ahead to 2010 


As this month's International AIDS Conference in Mexico City ends, Erika Check Hayden asks leaders of 
the HIV community what they expect to see on the agenda when the conference reconvenes in Vienna in 
2010 — the year by which the world has agreed to provide universal access to HIV treatment. 


Julio Montaner 
President, International 
AIDS Society, 2008-10 
We want to make 2010 
an accountability con- 
ference, so that we have 
a clear idea of which 
promises have been 
made, kept and broken 
in the roll-out of antiretroviral therapy. And 
there will be more focus on eastern Europe, 
where the epidemic is a catastrophe brewing. 
The issue of intravenous drug users is going to 
become dominant. Also, the distance between 
what we know works and what we're actually 
doing needs to be closed, or the epidemic will 
become totally out of control. 


Michel Kazatchkine 
Executive director, Global 
Fund to Fight AIDS, 
Tuberculosis and Malaria 
I expect more progress 
from the science of treat- 
ment, with more powerful 
combination treatments 
and ways of selecting 
combinations of antiretrovirals to provide 
long-term suppression of the virus and avoid 
resistance. I expect a lot more information on 
how antiretroviral therapy can help with pre- 
vention. And I’m looking for what percentage 
of countries will have reached universal coverage 
in treatment; I'm quite hopeful for a number of 
middle-income countries in western Africa and 
Latin America. 


Peter Piot 

Executive director, 

Joint United Nations 
Programme on HIV/AIDS 

I think even more atten- 
tion will be paid to the 
‘how of prevention. We 
should have results of the 
big studies of pre-expo- 
sure prophylaxis [PrEP]. I hope we will have a 
better understanding of protective immunity 
and more information on resistance to antiret- 
rovirals. And there will be more on linking up 
countries’ health systems to AIDS activities. 
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Tachi Yamada 
President, Bill & Melinda 
Gates Foundation 
Global Health Program 

I hope we will have 
thought through how 
to deal with drug resist- 
ance and the need to aa 

go to second-line treat- 

ments. In prevention, I would like to see that 
we've started on the roll-out of circumcision, 
and have doubled the prevention of mother- 
to-child transmission. Id like to know whether 
the addition of antiretrovirals to microbicide 
treatment will make a difference. And Id like 
a plan to address multiple concurrent partners 
in high-prevalence countries. 


Zeda Rosenberg 

Chief executive, 
International Partnership 
for Microbicides 

The two ongoing trials 
of early-generation 
microbicides will have 
been completed and the 
next generation of micro- : 
bicides should be well on its way to showing 
efficacy. The CAPRISA trial of tenofovir gel 
should be finished, and the data will be under 
analysis. Several trials of next-generation 
microbicides are due to be launched — both 
safety studies of newer products, and efficacy 
studies of products based on non-nucleoside 
reverse transcriptase inhibitors. 


Alan Bernstein 
Executive director, Global 
HIV Vaccine Enterprise 
The revolution in bio- 
medical research, with 
high-throughput tech- 
nology, systems biology 
and computational biol- 
ogy, needs to be applied 
to vaccines. From that we might learn how to 
achieve protection. There needs to be more 
emphasis on how we measure what's going on 
early in infection. There seems to be a brief 
window of opportunity after someone becomes 
infected when we can intervene. 
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Mitchell Warren 
Executive director, AIDS 
Vaccine Advocacy 
Coalition 

We should have data 
from up to four of the 
initial PrEP studies and 
from a 16,000-person 
Thai vaccine trial of a 
prime-boost combination of ALVAC-HIV and 
AIDSVAX. Ideally, we will have at least one new 
vaccine and/or microbicide product in an effi- 
cacy trial. We should be talking about impact 
from the male-circumcision programme scale- 
up as opposed to research results. And we will 
be talking about ‘antiretroviral-based prophy- 
laxis’ as opposed to microbicides or PrEP. 


Mark Dybul 

US global AIDS 
coordinator 

I hope we have more data 
on the impact of HIV 
on countries’ health sys- 
tems and on innovative 
approaches to incorporat- 
ing the private sector into 
prevention. The HIV community should be 
looking at other public-health efforts that have 
had generational behavioural change. We should 
see more full-country ownership, with all sectors 
talking about what they've done with the free- 
dom to do what they need to do. I hope we see 
more of that and less discussion of global archi- 
tecture and the things we know how to do. 


Kevin De Cock 

Director of the HIV/AIDS 
Department, World 
Health Organization 

Male circumcision needs 
to be scaled up. We're 
waiting for answers about 
the use of antiretroviral 
therapy in people already ‘ 
infected with HIV, to see whether it lowers 
viral load and how that translates into reduced 
transmission. I don’t think we're going to get 
universal access in all countries by 2010, but 
there are some places, such as Namibia and 
Botswana, that can do it. | 
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FRAUD AND 
MISCONDUCT 

Do we need a penal 
system for science? 
www.nature.com/news 
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Anthrax case ignites new forensics field 


The messy tragedy surrounding the 2001 US 
anthrax attacks and the suicide of chief sus- 
pect Bruce Ivins has thrown the emergent field 
of microbial forensics into the spotlight. The 
forensic techniques proved vital in allowing the 
Federal Bureau of Investigation (FBI) to make 
its case that the anthrax used in the attacks 
came from a particular sample in Ivins’s lab. 

Ivins died before facing charges, but had his 
case reached trial it would have been a major 
test for the discipline, which barely existed 
before 2001. 

Microbial forensics traces the origin of a 
biological agent using a range of biochemical 
analyses, including genomic sequencing and 
protein and carbohydrate fingerprinting. The 
discipline has its roots in military and intelli- 
gence communities, and was used successfully 
in court as far back as 1998, in the trial of a 
doctor in Louisiana who intentionally infected 
his former mistress with HIV. But the field took 
off during the anthrax investigation with an 
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Anthrax spores were forensically analysed. 


influx of government funding and the advent 
of cheaper, faster sequencing technology. 
Microbial forensics is likely to have an increas- 
ing role beyond biocrime. “The big thing that’s 
going to be useful out of all this is its application 
to molecular epidemiology,’ says Paul Keim, a 
molecular geneticist at Northern Arizona Uni- 
versity in Flagstaff, who worked with the FBI 
investigation. For example, microbial forensics 
could be used to pinpoint the source of food- or 


water-borne pathogens. Referring to the recent 8 
US Salmonella outbreak, Keim says, “If the same 2 
thing happens in a year, I would expect to see 5 
hundreds of Salmonella genomes sequenced? 

The field could also be important in lawsuits 
over hospital-acquired infections, says micro- 
biologist Abigail Salyers of the University of 
Illinois at Urbana-Champaign. People have 
sued hospitals for millions, claiming they con- 
tracted methicillin-resistant Staphylococcus 
aureus (MRSA) as patients. Microbial foren- 
sics could be used to trace an infection to its 
source and ascertain whether it was acquired 
at a particular hospital. 

Microbiologists will need to be confident of 
their technique — forensic analysis requires 
unusual rigour. In a journal publication, it 
is usually understood that further work is 
needed, says Keim. “In court, you don’t have 
that luxury,’ he says. “You have to get it right 
the first time” a: 
Amber Dance 
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Large Hadron Collider 
passes first proton test 


The world’s largest particle accelerator has 
tasted its first protons. 

On 8 August, physicists injected a 
few billion protons into a section of the 
Large Hadron Collider (LHC) at CERN, 
Europe’s high-energy physics laboratory 
near Geneva, Switzerland. The accelerator 
will eventually drive trillions of protons 
into each other at energies high enough to 
perhaps generate new kinds of particles. 

The small beam tested the synchronization 
between the LHC and a booster accelerator. 
“I must say I was very pleased,’ says Lyn 
Evans, LHC project leader at CERN. “It went 
extremely smoothly.” 

Further tests will be carried out in coming 
weeks, with a view to having protons 
running laps around the 27-kilometre LHC 
ring by 10 September. 


Environment agency denies 
plea to cut biofuel goals 


The US Environmental Protection Agency 
(EPA) last week upheld the national 
mandate to use biofuels. The decision was 
made despite opposition from livestock 
and environmental groups, which say that 
the expansion of corn (maize) ethanol is 
pushing up food prices and damaging the 
environment. 

EPA administrator Stephen Johnson 
denied the request of Rick Perry, the 
Republican governor of Texas, to halve the 
‘renewable fuels standard, which requires 
the nation to produce 34 billion litres of 
ethanol in 2008. Perry had argued that 
livestock producers in his state spent an 
additional US$1.2 billion on corn feed 
between 2004 and 2007, a trend that he 


linked to the expansion of biofuels. 

But Johnson says that repealing the 
standard would do little, if anything, to 
bring corn prices down. He also says that the 
EPAs analysis found no evidence that the 
mandate is damaging the national economy. 
For alonger version of this story, see http://tinyurl. 
com/5hr3jv. 


San Francisco's biotech hub 
gets lift from Pfizer deal 


San Francisco's drive to turn the Mission 
Bay area into a technology-transfer hub has 
been boosted by Pfizer’s decision to move 
one of its research centres to the site. 

In early 2010, the Biotherapeutics and 
Bioinnovation Center will move into newly 
constructed lab facilities near the Mission 
Bay research complex run by the University 
of California, San Francisco. About 100 of 
the centre’s employees will move there from 
its current base in South San Francisco. 

Five years after the university opened 
facilities at Mission Bay, biotech firms are 
finally trickling in. In June, Pfizer, which 
is headquartered in New York, agreed a 
US$9.5-million, three-year deal for research 
collaborations with Mission Bay’s California 
Institute for Quantitative Biosciences. 


Gorilla numbers doubled 
in the Congo 


Western lowland gorillas (Gorilla gorilla 
gorilla) are more common than previously 
thought, according to a census of the 
northern regions of the Republic of the 
Congo. Led by the New- York-based Wildlife 
Conservation Society (WCS), the count 
found 125,000 of the apes in a 47,000- 
square-kilometre area. Previous estimates 
held that there were fewer than 100,000 


Hotspot for midges proves to be fertile ground 


This false-colour infrared oF) 
image, taken in September 
2006 with the ASTER 
sensor on NASA's Terra 
satellite, shows Myvatn, 

a shallow lake in Iceland 
from which emerge huge 
numbers of mating midges 
in summer. 

Bright red indicates 
vigorous vegetation; the 
thin border of hot pink 
around the lake has been 
fertilized by the bodies 


of dead midges. In late July and early August, 135 kilograms of biomass was deposited there per 
hectare per day, according to Phil Townsend, a remote sensing specialist at the Department of 
Forest and Wildlife Ecology at the University of Wisconsin in Madison. 

Townsend, who presented his results last week at the Ecological Society of America meeting in 
Milwaukee, Wisconsin, is using satellite imagery to identify more ‘midge lakes’. 
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western lowland gorillas in the entire 
world, and the International Union for 
Conservation of Nature has speculated 
that the Ebola virus could have reduced 
numbers even further. 

Census-takers counted the gorillas’ 
sleeping nests. The WCS attributes the 
high numbers to successful management 
of protected areas in the Republic of 
the Congo, a food-rich habitat and the 
“remoteness and inaccessibility” of the 
region. The work was presented at the 
International Primatological Society 
Congress in Edinburgh, UK. 

Even so, a separate global assessment 
presented at the same conference reports 
that half of primate species are threatened 
with extinction in the next ten years. 

For a longer version of this story, see http://tinyurl. 
com/68ewes 


FDA further tightens 
conflict of interest policies 


The US Food and Drug Administration 
(FDA) is introducing guidelines to reduce 
potential conflicts of interest among 
members of its advisory committees. 

In part, the regulations address a 2007 law 
that requires the agency to cut the number of 
advisers who have conflicts of interest by one- 
quarter over the next five years. According 
to the guidelines, committee members may 
not have financial ties to the topics under 
consideration. Those with financial interests 
totalling less than $50,000 would be eligible 
for waivers allowing them to participate. 

The guidelines also instruct members 
of committees to vote simultaneously. 
Previously, members stated their votes 
aloud in succession, which could influence 
later voters. In addition, the FDA intends 
to make information about committee 
activities more accessible by posting related 
documents on its website prior to meetings. 


Last week saw the 2008 SciFoo, the annual 
‘unconference’ organized by Nature, Tim O'Reilly 
and Google. Topics ranged from the Large 
Hadron Collider to the neuroscience of the brain- 
machine interface and a new social contract for 
science. Details and participants are at 
www.nature.com/scifoo. 
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On the scent 


For Marcel Proust, the madeleine evoked childhood memories — a new treatment of the science of smell 
attempts to take our everyday experience of odour to a more insightful level, explains Gary Beauchamp. 


by Avery Gilbert 
Crown Publishers: 2008. 304 pp. 
ESSE), SASSI) 


In 2004, the Nobel Prize for Physiology or 
Medicine was awarded to Linda Buck and 
Richard Axel for their work on the science 
of smell. They discovered the family of more 
than a thousand olfactory receptors that rec- 
ognize the volatile compounds that make 
up our scented world. To researchers in the 
field, this honour was neither surprising nor 
unanticipated. Many outsiders wondered: 
what is the social, biological and medical sig- 
nificance of this breakthrough that makes it 
so noteworthy? 

Avery Gilbert’s engaging book What the Nose 
Knows attempts to answer this question. Focus- 
ing on our experiences of scent in everyday life, 
it proclaims the importance and curious nature 
of the human sense of smell. Until the last chap- 
ter, which speculates on the future, Gilbert has 
little to say about the biological science under- 
lying olfaction. He has even less to say about 
odours in other mammals and insects, where 
they have a greater role in regulating social and 


sexual activity than in humans. What this book 
lacks in basic biological science, it makes up 
for in psychological analysis, complemented 
throughout with literary allusions that Gilbert 
skilfully uses to illustrate his insights. 

Gilbert draws on his personal experience, 
both anecdotal and scientific. He asks how 
many odours there are and how we might 
determine them, and describes the principles 
of perfumery. He addresses the psychological 
aspects of odour phobias and multiple chemo- 
sensory disorder, and explores odour memory 
within the literature of Marcel Proust. Gilbert 
seems especially fond of stories of the smell of 
dead bodies and human gaseous emanations. 
The book is perhaps overly larded with short, 
albeit fascinating, stories and olfactory anec- 
dotes, and lighter on synthesis. Citations to the 
psychological literature abound, but they are 
listed at the end and the reader must flick back 
and forth to find a source. 

The story of odorizing the movie experience 
is delightfully told. The ‘battle of the smellies’ 
between the scientifically based Smell-O- Vision, 
pushed by the impresario Mike Todd, and its 
rival AromaRama illuminates both the entre- 
preneurial spirit of the 1950s and the scientific 
and technological aspects of odour choice and 


© 2008 Macmillan Publishers Limited. All rights reserved 


delivery. The difficult task of odour removal is 
highlighted by cinema that, in the words of one 
reviewer, was filled with scent “strong enough 
to give a bloodhound a headache”. 

Gilbert adopts a strong position on matters 
of controversy, sometimes trying hard to arouse 
disagreement. He persuasively dismantles the 
idea of Proust as poster boy for the profundity 
of odour memory, but I remain convinced that 
major research efforts to understand the phe- 
nomenon are still needed. When I encounter the 
scent of my grandfather's garage, I am not only 
reminded of the place — Iam briefly transported 
there. Gilbert, too, calls attention to this power: 
“Like a nightclub mentalist, the mind presents 
us with a memory it picked from our pocket 
when we werent looking.” The neural pathways 
unique to olfaction are probably involved but do 
not completely explain the experience. 

Similarly, I question Gilbert’s argument that 
smell makes a greater contribution to flavour 
than does taste. First, it is unclear what metric 
would be useful in measuring this. And there 
are many reasons to believe that taste — sweet, 
sour, salty, bitter and umami or savoury — is 
more potent as far as food selection and intake 
are concerned. Smell permits one to discrimi- 
nate strawberry from cherry, but taste provides 
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the information necessary to decide whether 
an object should be taken into the body or 
expelled. 

Many years ago, when working with Gilbert 
at the Monell Chemical Senses Center in Phil- 
adelphia, I recognized his way with words as 


well as with scientific research. What the Nose 
Knows melds the academic and business worlds 
of smell into an entertaining and illuminating 
rumination on this almost magical sense that, 
even with a Nobel Prize to its credit, still holds 
many mysteries. a 


Gary Beauchamp is director of the Monell 
Chemical Senses Center, 3500 Market Street, 
Philadelphia 19104, Pennsylvania, USA. With 
Stuart Firestein and David V. Smith, he is co-editor 
of Olfaction and Taste. 

e-mail: beauchamp@monell.org 


Playing to win 


Starting With Serotonin: How a High-Rolling 
Father of Drug Discovery Repeatedly Beat 
the Odds 

by Ann G. Sjoerdsma 

Improbable Books: 2008. 640 pp. $27.50 


Science is a gamble. Publication, applying 
for grants, student admissions and corporate 
relationships all involve high-stakes bets, a 
mixture of skill and luck, and often a bit of 
bluffing. Which game is science most like? 
It’s not a slot machine, mindlessly addictive. 
In dark moments it may seem like roulette, 
with its powerful house advantage and long 
odds. Sometimes it’s a horse race, when one 
thoroughbred laboratory noses out another 
in isolating a long-sought gene or subatomic 
particle. For Albert Sjoerdsma, sometimes 
called the father of clinical pharmacology, sci- 
ence was most like craps. 

Craps, an intricate dice game that can involve 
many players and interweaving rounds of bet- 
ting, is a thinking person’s pastime. Winning 
depends on an understanding of probability 
and being able to weigh complex constella- 
tions of risks and payouts. In Starting with Sero- 
tonin, Sjoerdsma’s biographer daughter Ann 
Sjoerdsma argues that craps was her father’s 
favourite game of chance — and the key theme 
in his scientific life. 

Albert Sjoerdsma came to the table with a 
modest stack of chips. Born in 
1924 and raised near Chicago 
in Illinois, he grew up with little 
money or social sophistication, 
but with a first-class brain and a 
mountain of confidence. He was 
a rough-and-tumble child, more 
likely to be found playing sports 
or getting into mischief than curled up with 
his nose in a book. Yet his grades were nearly 
perfect, and he was accepted at the University 
of Chicago under president Robert Hutchins, 
whose innovative programmes helped to train 
some of the best minds of the late twentieth 
century. Sjoerdsma was cocky and sometimes 
disrespectful, especially when faced with arbi- 
trary displays of power. His daughter describes 
him as a maverick, a clichéd but apt term. 


“Craps was 
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Sjoerdsma's favourite 
game of chance 

— and the key theme 
in his scientific life.” 


Rather than fold with a bachelor’s degree, 
Sjoerdsma stayed in the game, taking an MD 
and a PhD at Chicago. He then went east to 
Bethesda, Maryland, where he joined the 
National Institutes of Health (NIH) in 1951. 
After two years of residency at the Public 
Health Service's Marine Hospital in Balti- 
more, Sjoerdsma landed a position at the 
National Heart Institute back 
in Bethesda. There, he formed 
a team that became known 
locally as the “wild bunch’, 
a group of brilliant, hard- 
working and hard-playing 
researchers. 

Sjoerdsma began exploring 
ways of reducing high blood 
pressure, leading to his inves- 
tigation of the then recently 
discovered neurotransmit- 
ter serotonin. His analysis of 
its effects on different organ 
systems and metabolic path- 
ways led him into a strongly 
applied style of pharmacology, 
in which he largely eschewed 
lab-based studies in favour of whole patients, 
and focused on bridging the gap between labo- 
ratory science and clinical medicine. 

He was a pioneer in the development of 
monoamine oxidase inhibitors as antidepres- 
sants, such as iproniazid, originally developed 
as an anti-tuberculosis drug. With colleague 
Sidney Udenfriend, he found that monoamine 
oxidase was a major pathway 
for serotonin in both mice and 
humans, and that inhibitors 
such as iproniazid raised blood 
serotonin levels. This estab- 
lished the physiological basis 
of the antidepressant action 
of these drugs. Sjoerdsma’s 
20 years at NIH coincided with the ‘golden 
age’ of its intramural research, an era of Nobel 
prizes, headline-grabbing breakthroughs and 
major contributions to science. 

In 1972, he parlayed his successes as a bench 
scientist into a job as director of anew research 
centre in Strasbourg, France, set up by the 
pharmaceutical company Richardson-Merrell. 
His blunt, incisive intellectual and administra- 
tive style was polarizing in genteel Europe. He 
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won many devoted friends and colleagues — 
and lost some, too. But there was no arguing 
with his successes. The biggest was terfenadine 
(Seldane), the first antihistamine that did not 
cause drowsiness. Developed in the 1970s, it 
was a blockbuster drug until heart arrhythmias 
surfaced in some users. 

Sjoerdsma remained with Richardson- 
Merrell through the 1980s, 
until a merger moved the 
company to Kansas City, Kan- 
sas, and gave the management 
over to a group of business- 
men who knew everything 
about marketing and nothing 
about science. Sjoerdsma felt 
the house had changed the 
rules mid-game. Stripped of 
his title, and eventually even 
of his parking space, he felt 
as if he had lost the shirt off 
his back. 

Ann Sjoerdsma’s dual role, 
as daughter and as profes- 
sional journalist, creates both 
windows and blind spots as 
she examines her father’s life. She explains 
the science and integrates it into Sjoerdsma’s 
career choices and decisions. She draws on 
interviews she conducted with her father and 
with his friends and colleagues. The numer- 
ous quotations from Sjoerdsma himself, set in 
italics and without attribution, make it seem 
as though he is looking over his daughter's 
shoulder, adding a story or colourful detail, 
or murmuring assent. 

His assent is crucial, for her primary con- 
cern is to tell her father’s version of his story. 
In Sjoerdsma’s world, the US Food and Drug 
Administration is a stifling regulatory mon- 
ster, and it is drugs, more than patients, that 
live or die. Sometimes he comes across as cal- 
lous, whereas at other times he is a champion 
of humanitarian medicine, such as when he 
developed a therapy for African sleeping sick- 
ness. The author rarely questions such views or 
their motives. Also, she never inquires deeply 
into Sjoerdsma’s emotions. 

Particularly striking is the minor role of fam- 
ily in this life portrait. The other Sjoerdsmas 
feature from time to time, but we never get 
much sense of how Albert treated them or how 
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an object should be taken into the body or 
expelled. 
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After two years of residency at the Public 
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more, Sjoerdsma landed a position at the 
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in Bethesda. There, he formed 
a team that became known 
locally as the “wild bunch’, 
a group of brilliant, hard- 
working and hard-playing 
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Sjoerdsma began exploring 
ways of reducing high blood 
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scientist into a job as director of anew research 
centre in Strasbourg, France, set up by the 
pharmaceutical company Richardson-Merrell. 
His blunt, incisive intellectual and administra- 
tive style was polarizing in genteel Europe. He 


© 2008 Macmillan Publishers Limited. All rights reserved 


won many devoted friends and colleagues — 
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he integrated work and family life. Understand- 
ably, the daughter's loyalty seems to trump the 
journalist’s objectivity. 

Yet perhaps only she could have played the 
theme of gambling so well, arguing that her 
father’s strategy was the same at the bench 
and at the table. He played to win, not to get 
rich. He was disciplined and used gambling 
as a test of his willpower. Sjoerdsma’s strategy 
was conservative, but he was willing to take big 
risks and to bet high stakes when the odds were 
good. As the author says, he repeatedly beat 
the odds. 


Do not read Starting With Serotonin for 
a story of the selfless thirst for knowledge. 
Rather, read it for the strategy — both experi- 
mental and mercantile — and the passion for 
competition. Sjoerdsma did it for the love of 
the game, for better and for worse. a 
Nathaniel Comfort is associate professor in the 
Institute of the History of Medicine at Johns 
Hopkins University, 1900 East Monument Street, 
Baltimore 21205, Maryland, USA. His most 
recent book is The Panda's Black Box: Opening Up 
the Intelligent Design Controversy. 
e-mail: comfort@jhmi.edu 


Challenging stereotypes 


Women in Science, Engineering and 
Technology: Three Decades of UK Initiatives 
by Alison Phipps 

Trentham Books: 2008. 184 pp. 

£16.99, €25.50 


The lack of women, especially senior women, 
in science departments is familiar. Less 
widely appreciated is the effort that has gone 
into addressing this under-representation. 
Women in Science, Engineering and Technol- 
ogy presents a history of around 150 initiatives 
to encourage women’s participation in science, 
engineering, construction and technology in 
the United Kingdom during the past three dec- 
ades. Alison Phipps, director of gender studies 
at the University of Sussex, UK, has compiled a 
valuable resource for activists, policy-makers 
and educational practitioners, also 
providing social and political context 
and analysis. 

Toa woman working in science, 
rather than one working for women in 
science, the book is much more than a 
reference manual. It puts our experi- 
ence in context — within the global 
economy and the women’s move- 
ment. Phipps makes a compelling 
case that achieving fundamental 
change depends on understanding 
this context and she urges closer 
communication between educators, 
academics and social scientists. 

Phipps describes how the shift 
from manufacturing to knowledge- 
based economies has created 
demand for skilled workers in 
science and technology. As in 
many countries, attracting more 
women into technical jobs has been 
identified as a priority for raising Britain’s 
competitive position in the global economy. 


Phipps distinguishes between economic and 
moral motivations to promote gender equal- 
ity in science, and alerts us to the dangers of 
relying on the current alignment between the 
interests of the economy and womens rights. 
Economic incentives may change with the 
political climate, leaving women in a vulner- 
able position if their skills are no longer valued. 
Also, the ‘business case for equality does not 
concern itself with the underlying reasons for 
gender inequality, and undermines the idea of 
equality as an end in itself. 

Encouragement for women to pursue careers 
in science begins in the classroom. Women 
in Science, Engineering and 
Technology compares grass- 
roots initiatives in schools 

with corporate-led and 


Schoolgirl Amy's depiction of a typical 
scientist (left) changed after a visit to 
a laboratory (right). 
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national government schemes. Phipps argues 
that progress has been limited by targeting “the 
problem with girls’, namely that they are gender- 
stereotyped and fail to recognize the attraction 
of a career in science. The most successful cam- 
paigns described address underlying reasons 
for girls’ career choices, such as the expecta- 
tions of parents and peer group, and pressures 
from wider society and culture. Less successful 
were initiatives that attempted to make science 
‘girl-friendly; for example an after-school club 
that taught computing skills through topics 
such as celebrity, fashion and music. As Phipps 
points out, this widely encountered approach 
reinforces the stereotypes that such campaigns 
are seeking to challenge. 

Phipps presents statistics to show that uni- 
versities are among the worst institutions in the 
United Kingdom in terms of gender equality. 
Women who choose a career in science may 
receive support from many professional wom- 
en's groups and networks. Yet participation is 
often low owing to limited funding and lack 
of time to participate. Phipps assesses that the 
success of initiatives has been limited by focus- 
ing on helping women to survive and thrive in 
existing masculine environments, rather than 
challenging the underlying work culture and 
perception of the ideal scientist as a masculine 
figure: rational, competitive, independent and 
technically skilled. 

If we fail to challenge the interaction between 
patriarchy and the professionals, Phipps argues, 
then increasing the numbers of women within 
technical professions “will not bring about the 

hoped-for transformation” In law and medi- 
cine, despite the fact that women make up a 
growing and significant proportion, there 
remains a marked pay gap, and the work- 
ing culture is largely unchanged. The 
book does not provide any quick solu- 
tions: increased flexibility in working 
hours and better maternity pay support 
a few highly motivated women, usually white 
and middle-class, to develop their careers. 
These initiatives also underline society's 
expectation that women will take on 
caring, domestic roles as well as profes- 
sional ones. 
We all know talented women who have 
dropped out of science from a feeling of “not 
being good enough’ to do research or to lead a 
research group. Phipps’s book puts this inner 
voice to rest by showing that such feelings arise 
from one’s experience of society, culture and 
capitalism, not one’s abilities as a scientist. ™ 
Ashleigh Griffin is a Royal Society Dorothy 
Hodgkin research fellow at the Institute of 
Evolutionary Biology, University of Edinburgh, 
West Mains Road, Edinburgh EH9 3JT, UK. 
e-mail: a.griffin@ed.ac.uk 
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he integrated work and family life. Understand- 
ably, the daughter's loyalty seems to trump the 
journalist’s objectivity. 

Yet perhaps only she could have played the 
theme of gambling so well, arguing that her 
father’s strategy was the same at the bench 
and at the table. He played to win, not to get 
rich. He was disciplined and used gambling 
as a test of his willpower. Sjoerdsma’s strategy 
was conservative, but he was willing to take big 
risks and to bet high stakes when the odds were 
good. As the author says, he repeatedly beat 
the odds. 


Do not read Starting With Serotonin for 
a story of the selfless thirst for knowledge. 
Rather, read it for the strategy — both experi- 
mental and mercantile — and the passion for 
competition. Sjoerdsma did it for the love of 
the game, for better and for worse. a 
Nathaniel Comfort is associate professor in the 
Institute of the History of Medicine at Johns 
Hopkins University, 1900 East Monument Street, 
Baltimore 21205, Maryland, USA. His most 
recent book is The Panda's Black Box: Opening Up 
the Intelligent Design Controversy. 
e-mail: comfort@jhmi.edu 
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Bucky's utopian universe 


In 1927, on the shore of Lake 
Michigan, an unemployed and 
possibly drunk US inventor 
contemplated suicide. Then, 
according to the legend he 
later spun, he had an out-of- 
body experience. He felt he 
was floating above the ground, 
surrounded by celestial light. 
A voice spoke to him: “You do 
not have the right to eliminate 
yourself,” it said, “You do not 
belong to you. You belong to 
Universe” 

Buckminster ‘Bucky’ Fuller 
— architect, environmen- 
talist, author and visionary 
— responded to this call by 
wholeheartedly attempting to 
save humanity through sci- 
ence and design. Fifty years 
of his imaginative and influ- 
ential work is the subject of a 
new exhibition at the Whitney 
Museum of American Art in 
New York. 

Fuller described himself as 
a “comprehensive, anticipa- 
tory design scientist”. He is best 
known for his geodesic dome. 
Hundreds of them sprung up 
like mushrooms in the late 
1950s and 1960s, housing every- 
thing from museums to military 
bases. Fuller employed tetrahe- 
drons, which he considered to 
be nature’s essential building 
blocks, to enclose a spherical 


Boris Artzybasheff's 1963 portrait of Buckminster Fuller. 


be erected in a day. The build- 
ings looked like spaceships and 
boasted all the latest appliances, 
such as air-conditioning and a 
bathroom unit that required 
only a litre of water to give youa 
ten-minute clean with a Fuller- 
designed fog gun. He persuaded 
an aircraft firm to build two 
aluminium prototypes. But the 
so-called Dymaxion house — a 
neologism combining ‘dynamic, 
‘maximum and ‘ion — proved 
too expensive to mass produce. 
Fuller invented a three- 
wheeled Dymaxion vehicle 
to complement his “New Era 
Home” This blimp-like car had 
a top speed of 190 kilometres 
an hour, a periscope instead of 
a rearview mirror, and allowed 
you to park front-first and then 
swing into the tightest of spaces. 
Only three were built — produc- 
tion was halted in 1934 after one 
crashed outside the entrance 
to the Chicago World’s Fair, kill- 
ing the driver. The sole surviv- 
ing example is on view at the 
Whitney, along with a short 
film of Fuller showing off the 
car’s tight turning circle. 
Fuller’s most important leg- 
acy is as a futurist who warned 
of looming environmental 
catastrophe unless technology 
was harnessed to create ingen- 
ious solutions. If the technology 


space using less material than 
almost any other structure. The Ford Motor 
Company commissioned one of Fuller’s strong 
and lightweight domes in 1953; six years later, 
another housed the American National Exhibi- 
tion in Moscow, the backdrop for the famous 
‘kitchen debate’ between US vice-president 
Richard Nixon and 
Soviet premier Nikita 
Khrushchev. In 1967, 
a dome formed the 
centrepiece for the 
World’s Fair in Mon- 
treal, Canada. Fuller 
also planned a dome some three kilometres in 
diameter to cover mid-town Manhattan like an 
enormous jellyfish, claiming that it would pay 
for itself in a decade by the money saved from 
not having to shovel snow. 

The geodesic dome was like a thought bubble 


New York 
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Whitney Museum of American Art, 


encapsulating the utopian dream of modernity. 
Yet most of Fuller’s schemes never left the draw- 
ing board. Some of his earliest ideas, on show 
in the exhibition, are wildly imaginative. He 
planned a network of airport towers so high that 
planes could dock above the clouds, imagined 
planting skyscrapers in 
craters left behind by 
Zeppelin-delivered 
bombs, and in one 
drawing he attempted 
to ‘synthesize’ the 
Brooklyn Bridge and 
a Ferris wheel. 

Fuller refashioned these outlandish ideas 
on a domestic scale. In 1928, attempting to 
solve the US housing crisis, he designed a 
dwelling whose entire structure hung from a 
central pole like a merry-go-round, and could 
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in which he had such faith had 
been able to keep pace with his thinking, per- 
haps more of his unconventional ideas might 
have been realized. 

During Fuller’s last decade, the architect 
Norman Foster collaborated with him, and 
Foster acknowledges Fuller's influence on his 
later, environmentally conscious buildings. Off 
the coast of Dubai, Dutch architects are build- 
ing maritime structures that echo Fuller’s plans 
for a floating Triton City. Perhaps one day we'll 
live in the tethered cloud structures that he also 
dreamed up. 

“T did not set out to design a geodesic dome,” 
Fuller wrote. “I set out to discover the princi- 
ples operative in Universe. For all I knew, this 
could have led to a pair of flying slippers.” 

cht is a writer based in New York 
and executive editor of Modern Painters magazine. 
e-mail: drchristopherturner@gmail.com 
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In Retrospect: 


Gédel's proof 


In today’s computer age, the implications of the discovery in formal logic that Newman and Nagel 
articulated in 1958 are of even broader interest, says Andrew Hodges. 


Gédel's Proof 
by Ernest Nagel and James R. Newman 
New York University Press: 1958. 118 pp. 


Fifty years ago an unusual book appeared. Its 
bald and unapologetic title, Gédel’s Proof, must 
have left the casual browser wondering who 
or what Godel was. Those tempted to look 
inside discovered a classic of scientific expo- 
sition and faced quite a challenge. The writ- 
ers, Ernest Nagel and James Newman, already 
distinguished figures in scientific philosophy 
and education, gave an uncompromising pres- 
entation of their unfamiliar subject matter: 
mathematical logic. Kurt Gédel himself, the 
great logician whose breakthrough discovery 
of 1931 was the subject of this book, was very 
much alive in 1958 Princeton, but he was no 
popularizer or media celebrity. For Nagel and 
Newman to see the potential interest to a wider 
public was both visionary and optimistic. 

Iremember discovering Gédel’ Proofas a stu- 
dent in 1968. I cannot have been the only one to 
find it a unique text on the college library shelf, 
leading to unexpected regions beyond the stand- 
ard syllabus. It was not written like a textbook; 
neither was it a ‘Gédel made easy. Although 
rooted in an earlier article in Scientific American, 
it used copious equations; indeed it explored the 
very meaning of equations, a demand on the 
reader that would make most publishers nerv- 
ous. Nagel and Newman explained difficult 
ideas of logical deduction from formal axioms, 
distinguishing formal proof from informal rea- 
soning. They showed Gédel’s crucial insight: 
that the rules of logic for quoting axioms, substi- 
tuting variables and formulating deductions are 
themselves mathematical operations. And they 
revealed how his technical innovation exploited 
this observation, using numbers to code state- 
ments about numbers. 

Nagel and Newman’s detailed account 
showed how Godel was led to the astonishing 
discovery of true mathematical statements that 
could not possibly have a formal proof. In other 
words, Gédel proved the formal incomplete- 
ness of mathematics. They also recorded the 
shock that this discovery caused to the hitherto 
mainstream positivist assumptions, such as 
those of Bertrand Russell, whose programme 
for deriving mathematics from purely logical 
axioms Godel explicitly contradicted. 

The implications of Gédel’s discovery are 
if anything of even broader interest now than 


in 1958. A vast industry has arisen founded 
on logical algorithms, and nowadays it is bet- 
ter appreciated that the business of computing 
is inseparable from the logical calculus built 
up in the early twentieth century. One could 
even argue that the underlying concept of the 
digital computer is owed to Gédel, via the Brit- 
ish mathematician Alan Turing. Turing’s 1936 
concept of the universal machine is the basis of 
the computer, and Turing 
arrived at it by following 
Gédel’s lead, seeing that 
instructions could oper- 
ate on other instructions, 
rather as Gédel’s numbers 
had coded formal state- 
ments about numbers. 

In the 1950s there was a 
tendency for mathemati- 
cians to distance themselves 
from practical applica- 
tions, and from comput- 
ing in particular. Since the 
1970s these divisions have 
become less rigid. Gédel’s 
arguments are now more 
fully connected with the 
body of mathematics and 
its classical problems of ‘how to solve it: In 2000, 
the Clay Mathematics Institute in Cambridge, 
Massachusetts, announced seven prizes for a 
set of Millennium Problems. One of these, con- 
cerning computational complexity, has its root 
ina remark of Gédel’s that might have seemed 
abstruse in 1958, but is now of great value to 
practical computing. 

It therefore now seems a little odd that Nagel 
and Newman paid no attention to comput- 
ing. They framed their closing reflections as if 
Turing’s theory of computability was an obvi- 
ous corollary. By contrast with their detailed 
explanation of Gédel’s technical arguments, 
they found no difficulty in writing off, in a few 
sentences, the possibility of artificial intelligence 
(AI). This is nowa huge and hotly contested area 
of scientific philosophy. In fact, it was already 
the subject of dispute in 1958. Turing himself, 
in 1950, argued that Gédel’s proof was irrelevant 
to the question of achieving AI. In the 1960s, 
Gédel in turn made somewhat delphic remarks 
objecting to Turing’s philosophy; he seems to 
have considered that his proof implied that the 
human mind could not be mechanized. 

These arguments stimulated another famous 
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GODEL'S PROOF 


book with Gédel in the title. Douglas Hofs- 
tadter’s 1979 Gédel, Escher, Bach (Penguin), 
although in many ways inspired by Newman 
and Nagel’s work, took an approach diamet- 
rically opposite to their clipped classicism. 
Expansive and illustrative, it also came to quite 
a different conclusion about AI — essentially 
Turing’s. This disagreement remains unre- 
solved; in fact it is heightened by another pro- 
tagonist, Roger Penrose, 
who supports something 
like Gédel’s position but in 
an entirely new way. 

Ina few pregnant words, 
Nagel and Newman referred 
to the brain as a machine 
apparently more power- 
ful, through its capacity for 
informal reasoning, than 
computers. Penrose, since 
the 1980s, has asked what 
could possibly lend it such 
power, finding an answer 
in the ill-understood phe- 
nomenon of quantum- 
mechanical state reduction. 
His conclusions are keenly 

— disputed: for instance the 
leading logician Martin Davis, himself a 
popularizer, has forcefully pressed Turing’s 
original view in his book Engines of Logic 
(W. W. Norton, 2001). 

Nagel and Newman dedicated their book to 
Russell, whose logical work during the opening 
years of the twentieth century lay behind Gédel’s 
proof. In that same period, Planck and Einstein 
opened the quantum-mechanical door on real- 
ity. A hundred years have not sufficed to resolve 
the fundamental questions they revealed. Math- 
ematical and physical science describe a contin- 
uous quantum universe using formal operations 
on discrete symbols. Neither the quantum, nor 
those symbols, nor the connection between 
them, are yet fully understood. a 
Andrew Hodges is a Fellow of Wadham College, 
University of Oxford, Oxford OX13PN, UK. Heis 
author of Alan Turing: the Enigma. 
e-mail: andrew.hodges@wadh.ox.ac.uk 


Correction 

In Martin Kemp's article on Ferdinand Verbiest 
(Nature 454, 405; 2008), the nationality of 
Chinese bandit hero Goyo was incorrectly given 
as Japanese. Also, it is his nickname Chitasei, 
not the name Wu Yong, that means ‘wise star’. 
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The speed of instantly 


Terence G. Rudolph 


5 + 
Swiss speedtrap: two 
wiss villages separated by 
kilometres (and Lake Geneva), 
«. yetconnected by.quantum ~ 
/ *értanglement. 


Pairs of quantum-mechanically entangled particles seem to know at once what is happening to each 
other. Experiments show that even if this signalling is not instantaneous, it must be really, really fast. 


One piece of Einstein’s theory of relativity that 
has taken hold in popular imagination can be 
summarized by the mantra “nothing travels 
faster than light”. What is less well known is 
that the theory of quantum mechanics, which 
deals with the behaviour of very small systems 
such as atomic and subatomic particles, violates 
the spirit (if not the letter) of this fundamental 
principle. Quantum mechanics predicts that, 
in certain circumstances, an activity performed 
on one particle can instantaneously change the 
properties of another particle, no matter how 
far apart the two particles are. On page 861 of 
this issue, Salart et al.' describe an experiment 
to test how fast ‘instantaneous’ really is. 

One particularly strange feature of quantum 
mechanics is quantum entanglement. In an 
experiment involving this phenomenon, 
a physical property of a particle (or larger 
system) becomes instantly dependent on the 
properties that are being measured on another 
particle, regardless of how far apart the par- 
ticles are. In a letter to Max Born in 1947, 
Einstein dismissively called this effect of 
quantum entanglement a “spooky action at a 
distance”, and thought it indicated that the 
theory of quantum mechanics was incorrect. 

Einstein was not the first to express repug- 
nance at instantaneous action at a distance. 


Two hundred and fifty years earlier, Isaac 
Newton wrote’: 
... that one body may act upon another at 
a distance through a vacuum, without the 
mediation of anything else, by and through 
which their action and force may be conveyed 
from one to another, is to me so great an 
absurdity that I believe no man who has in 
philosophical matters a competent faculty of 
thinking can ever fall into it. 


Newton was writing about his theory of 
gravity, and it was Einstein who showed in 
1915 that the action of gravity is not instanta- 
neous but is caused by a ‘mediatiom’ signal 
(the warping of space-time), which moves with 
a finite speed. Salart and colleagues’ experi- 
ment' tests whether the interaction between 
entangled particles is also conveyed by a medi- 
ating signal, and if so, how fast this signal must 
travel. 

In addition to acting instantaneously, and 
in contrast to other physical effects whose 
magnitudes vary with distance, the effects of 
quantum entanglement are predicted to have 
the same strength no matter how far apart the 
entangled systems are. Erwin Schrédinger 
was uncomfortable with this idea, and, in the 
very papers in which he introduced the term 
entanglement*”, proposed that some unknown 
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process must ensure that entanglement occurs 
over only microscopic distances. He was 
wrong. The distances over which entanglement 
has been shown to be maintained increase 
every year. For example, Salart and colleagues’ 
experiment’, which is not designed to test this 
question, itself measures the entanglement ofa 
pair of photons separated by 18 kilometres. 

Salart et al. entangled their photon pairs 
using a source in Geneva, Switzerland, then 
passed them through fibre-optical cables of 
exactly equal length to receiving stations in the 
villages of Jussy and Satigny, which lie respec- 
tively east and west of Lake Geneva. Here, the 
photons’ entanglement was checked by an 
identical pair of interferometers. As they had 
travelled identical distances, the photons would 
have reached the interferometers simultane- 
ously, as best as modern optics and electronics 
allows. Despite this, Salart et al. see consistent 
entanglement of their photons, which means 
that the time taken by any hypothetical signal 
passing between them is below the detection 
limit of the equipment. 

There is one subtle feature to all of this, 
however. Any hypothetical signal has its speed 
defined in a specific ‘preferred frame of refer- 
ence, which is not the same as the surface of 
the Earth. Salart et al. were able to check their 


831 


M-SAT LTD/SPL 


NEWS & VIEWS 


NATURE|Vol 454|14 August 2008 


results against all possible frames of reference 
by using Earth's rotation. The two villages in 
which they placed their detectors lie almost 
exactly east-west of each other, and the authors 
ran their tests at all hours of the day and night, 
allowing them to probe every possible orienta- 
tion of the experiment against a hypothetical 
preferred reference frame. Taking into account 
the accuracies of their experimental design and 
making some conservative assumptions — for 
example, that Earth is not moving relative to the 
preferred frame of reference at more than a thou- 
sandth the speed of light — Salart et al. conclude 
that any signal passing between the entangled 
photons is, if not instantaneous, travelling at 
least ten thousand times faster than light. 

The experiment of Salart et al.' beautifully 
probes the deep tensions between foundational 


aspects of two of our most fundamental physi- 
cal theories — relativity and quantum mechan- 
ics — using quantum entanglement. From it 
we can conclude that any theory that tries to 
explain quantum entanglement by invoking a 
transmission mechanism will need to be very 
spooky — spookier, perhaps, than quantum 
mechanics itself. a 
Terence G. Rudolph is in the Department of 
Physics, Imperial College London, 

London SW7 2AZ, UK. 

e-mail: t.rudolph@imperial.ac.uk 
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BIOCHEMISTRY 


Fit for an enzyme 


Shiven Kapur and Chaitan Khosla 


Certain enzymes that synthesize antibi 


otics play a game of pass the parcel, 


handing biosynthetic intermediates from one active site to another. A study 
reveals the dynamic nature of interactions between the enzyme domains. 


Although crystal structures provide vivid 
insights into the architecture of enzymes, they 
reinforce a static picture of the molecules, 
providing only a snapshot of what a protein 
looks like in one stable conformation. This 
can be misleading, because enzymes in solu- 
tion are certainly not static. Some, such as the 
megasynthase family of enzymes, are more 
like molecular versions of factory assembly 
lines, with moving parts shuttling intermedi- 
ates from one place to another. Two studies’* 
in this issue demonstrate the importance of 
dynamics in the catalytic cycle of a typical 
megasynthase that makes an antibiotic. Such 
insight could be used to produce variants of 
the enzyme that would make analogues of the 
naturally occurring antibiotic. 

The megasynthase family embraces several 
subclasses of enzyme, but of particular interest 
to this work are polyketide synthases (PKSs) and 
non-ribosomal peptide synthetases (NRPSs), 
both of which produce antibiotics — including 
penicillin and vancomycin, the powerful ‘drug 
of last resort: PKS and NRPS enzymes are large 
and multifunctional, consisting of clusters of 
active sites known as modules. Starting with 
simple organic molecules, each module catalyses 
a series of reactions in which a new molecular 
building block is added to the growing chain of 
the antibiotic, and is then modified to install the 
chemical groups required in the final product. 
During this process, biosynthetic intermediates 
are tethered to carrier proteins, which shuttle 
them in sequence to designated active sites 
(known as client enzymes) in the module. 
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X-ray crystallography has revealed the 
structures of several PKSs* and NRPSs‘, and 
those of related enzymes known as fatty-acid 
synthases’. These snapshot pictures provide 
essential information about their likely mech- 
anisms of action. But there is mounting evi- 
dence that protein dynamics has a vital role in 
enzyme catalysis — for example, the activity of 
adenylate kinase, a small enzyme often used as 
a model for biochemical investigations, criti- 
cally depends on conformational changes that 
occur on several different timescales” It seems 
likely that dynamic effects are also important 
in PKS and NRPS enzymes, but this cannot be 
determined from their crystal structures. 

Previous work’ has revealed the existence 
of conformational flexibility in the carrier- 
protein domains of NRPS enzymes when 
they interact selectively with different client 
enzymes. Expanding on these findings, Frueh 
et al.’ (page 903) used nuclear magnetic reso- 
nance to determine the structure of a carrier 
protein in complex with a type I thioesterase 
— an enzyme that catalyses the final release of 
antibiotic molecules from PKS and NRPS pro- 
teins. The authors observed that part of the thio- 
esterase acts as a lid, flipping between open and 
closed conformations that expose or conceal the 
enzyme’ carrier-protein binding site. They went 
on to show that this lid movement is necessary 
for the tether of the carrier protein to gain access 
to the active site, thereby catalysing the transfer 
of the antibiotic to the thioesterase. 

Similarly, Koglin et al.” (page 907) report 
conformational sub-states of a thioesterase II 
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Figure 1| Protein-protein interactions in 
megasynthase enzymes. Many megasynthases 
make antibiotics. Intermediate compounds in 
the antibiotic synthetic pathway are transported 
by carrier proteins to ‘client’ enzymes for 
modification. Two models have been proposed 
to explain the binding of carrier proteins to 
client enzymes. a, In the static model, fixed 
complementary features in the two proteins are 
enough to explain carrier—client specificity. b, In 
the dynamic model, specificity arises from the 
selection of complementary conformations of 
both the carrier and the client from ensembles 
of conformations in equilibrium. Non-binding 
enzymes from the same family as the client aren't 
recognized by the carrier, because too few of 
their molecules adopt the correct conformation 
at equilibrium. Frueh et al.' and Koglin et al.” 
provide evidence for the dynamic model of 
binding in some megasynthases. 


enzyme — a class of protein that repairs the 
tether of PKS or NRPS enzymes if it becomes 
‘damaged’ through attaching acetyl or related 
groups. The sub-states all shift towards a single 
conformation on interacting with an acetylated 
carrier protein. Taken together with Frueh and 
colleagues’ observations’, these results” sug- 
gest that at least some types of protein-pro- 
tein interaction in megasynthases are selected 
from an ensemble of conformations that exist 
in dynamic equilibrium with each other. Such 
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results against all possible frames of reference 
by using Earth's rotation. The two villages in 
which they placed their detectors lie almost 
exactly east-west of each other, and the authors 
ran their tests at all hours of the day and night, 
allowing them to probe every possible orienta- 
tion of the experiment against a hypothetical 
preferred reference frame. Taking into account 
the accuracies of their experimental design and 
making some conservative assumptions — for 
example, that Earth is not moving relative to the 
preferred frame of reference at more than a thou- 
sandth the speed of light — Salart et al. conclude 
that any signal passing between the entangled 
photons is, if not instantaneous, travelling at 
least ten thousand times faster than light. 

The experiment of Salart et al.' beautifully 
probes the deep tensions between foundational 


aspects of two of our most fundamental physi- 
cal theories — relativity and quantum mechan- 
ics — using quantum entanglement. From it 
we can conclude that any theory that tries to 
explain quantum entanglement by invoking a 
transmission mechanism will need to be very 
spooky — spookier, perhaps, than quantum 
mechanics itself. a 
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BIOCHEMISTRY 


Fit for an enzyme 


Shiven Kapur and Chaitan Khosla 


Certain enzymes that synthesize antibi 


otics play a game of pass the parcel, 


handing biosynthetic intermediates from one active site to another. A study 
reveals the dynamic nature of interactions between the enzyme domains. 


Although crystal structures provide vivid 
insights into the architecture of enzymes, they 
reinforce a static picture of the molecules, 
providing only a snapshot of what a protein 
looks like in one stable conformation. This 
can be misleading, because enzymes in solu- 
tion are certainly not static. Some, such as the 
megasynthase family of enzymes, are more 
like molecular versions of factory assembly 
lines, with moving parts shuttling intermedi- 
ates from one place to another. Two studies’* 
in this issue demonstrate the importance of 
dynamics in the catalytic cycle of a typical 
megasynthase that makes an antibiotic. Such 
insight could be used to produce variants of 
the enzyme that would make analogues of the 
naturally occurring antibiotic. 

The megasynthase family embraces several 
subclasses of enzyme, but of particular interest 
to this work are polyketide synthases (PKSs) and 
non-ribosomal peptide synthetases (NRPSs), 
both of which produce antibiotics — including 
penicillin and vancomycin, the powerful ‘drug 
of last resort: PKS and NRPS enzymes are large 
and multifunctional, consisting of clusters of 
active sites known as modules. Starting with 
simple organic molecules, each module catalyses 
a series of reactions in which a new molecular 
building block is added to the growing chain of 
the antibiotic, and is then modified to install the 
chemical groups required in the final product. 
During this process, biosynthetic intermediates 
are tethered to carrier proteins, which shuttle 
them in sequence to designated active sites 
(known as client enzymes) in the module. 
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X-ray crystallography has revealed the 
structures of several PKSs* and NRPSs‘, and 
those of related enzymes known as fatty-acid 
synthases’. These snapshot pictures provide 
essential information about their likely mech- 
anisms of action. But there is mounting evi- 
dence that protein dynamics has a vital role in 
enzyme catalysis — for example, the activity of 
adenylate kinase, a small enzyme often used as 
a model for biochemical investigations, criti- 
cally depends on conformational changes that 
occur on several different timescales” It seems 
likely that dynamic effects are also important 
in PKS and NRPS enzymes, but this cannot be 
determined from their crystal structures. 

Previous work’ has revealed the existence 
of conformational flexibility in the carrier- 
protein domains of NRPS enzymes when 
they interact selectively with different client 
enzymes. Expanding on these findings, Frueh 
et al.’ (page 903) used nuclear magnetic reso- 
nance to determine the structure of a carrier 
protein in complex with a type I thioesterase 
— an enzyme that catalyses the final release of 
antibiotic molecules from PKS and NRPS pro- 
teins. The authors observed that part of the thio- 
esterase acts as a lid, flipping between open and 
closed conformations that expose or conceal the 
enzyme’ carrier-protein binding site. They went 
on to show that this lid movement is necessary 
for the tether of the carrier protein to gain access 
to the active site, thereby catalysing the transfer 
of the antibiotic to the thioesterase. 

Similarly, Koglin et al.” (page 907) report 
conformational sub-states of a thioesterase II 
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Figure 1| Protein-protein interactions in 
megasynthase enzymes. Many megasynthases 
make antibiotics. Intermediate compounds in 
the antibiotic synthetic pathway are transported 
by carrier proteins to ‘client’ enzymes for 
modification. Two models have been proposed 
to explain the binding of carrier proteins to 
client enzymes. a, In the static model, fixed 
complementary features in the two proteins are 
enough to explain carrier—client specificity. b, In 
the dynamic model, specificity arises from the 
selection of complementary conformations of 
both the carrier and the client from ensembles 
of conformations in equilibrium. Non-binding 
enzymes from the same family as the client aren't 
recognized by the carrier, because too few of 
their molecules adopt the correct conformation 
at equilibrium. Frueh et al.' and Koglin et al.” 
provide evidence for the dynamic model of 
binding in some megasynthases. 


enzyme — a class of protein that repairs the 
tether of PKS or NRPS enzymes if it becomes 
‘damaged’ through attaching acetyl or related 
groups. The sub-states all shift towards a single 
conformation on interacting with an acetylated 
carrier protein. Taken together with Frueh and 
colleagues’ observations’, these results” sug- 
gest that at least some types of protein-pro- 
tein interaction in megasynthases are selected 
from an ensemble of conformations that exist 
in dynamic equilibrium with each other. Such 
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a model has previously been proposed””” to 
explain certain protein-protein interactions. 

Now that protein dynamics has been shown 
to play a role in antibiotic biosynthesis by NRPS 
(and thus probably also PKS) enzymes, the next 
step is to understand which, if any, of the many 
dynamic modes are involved in controlling the 
specificity of interactions between proteins. 
The importance of specific protein-protein 
interactions in such biosyntheses is well estab- 
lished'>”, but a better understanding of them is 
essential if the enzymes are to be rationally re- 
engineered to synthesize new antibiotics. Thus 
far, attention has focused mainly on identifying 
static features that allow protein domains to 
dock selectively to each other, such as regions 
that have complementary shapes or elec- 
tric charges (Fig. 1a). The assumption is that 
such features will be similar for evolutionarily 
related client enzymes, but will be different for 
other enzyme families. Ifso, one should be able 
to replace any individual client enzyme with a 
related enzyme, without perturbing the rest of 
the megasynthase. 

But if protein-protein specificity is the 
result of favourable docking between selected 
equilibrium conformations of the proteins’ 
domains (Fig. 1b), then rational design of mega- 
synthases requires knowledge of an entirely 
different set of issues. For example, how simi- 
lar are the conformational distributions of 
analogous domains in related proteins? Are 
the energy barriers for conformational switch- 
ing comparable for analogous domains across 
an entire family of enzymes? And does selec- 
tive docking always require the same pair of 
compatible conformations? 

The principles of physical organic chemistry 
were founded on the study of small molecules, 
and have provided a strong basis for eluci- 
dating the roles of static features in protein— 
protein recognition. But these ideas are of 
limited use in determining the rules that gov- 
ern protein dynamics. Time will tell whether 
these dynamics are pivotal to the behaviour 
of megasynthases. If we are ever to be able 
to modify such enzymes to prepare new anti- 
biotics, we can only hope that evolution has been 
parsimonious in writing the designrules. ™ 
Shiven Kapur and Chaitan Khosla are in the 
Department of Chemistry, Stanford University, 
Stanford, California 94305, USA. 
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EARTH SCIENCE 


Solid evidence in the inner core 


Kenneth C. Creager 


The indications are that a solid ball of iron lies at Earth's centre. But only the 
identification of an elusive seismic signature can confirm the long-standing 


assumption that it is indeed solid. 


It is widely accepted that Earth’s inner core is 
solid. Seismologists have long been trying to 
find direct evidence of this assertion — which 
could come in the form of the identification 
of a particular kind of seismic wave, called a 
shear wave, travelling through the inner core. 
Perhaps, as reported by Wookey and Helffrich 
on page 873 of this issue’, that elusive wave 
has now been teased out of the background 
seismic noise. 

Earth is made up ofa solid crust and man- 
tle overlying outer (fluid) and inner cores that 
consist primarily of iron. Large earthquakes 
send two types of seismic wave — compres- 
sional elastic and shear elastic waves — right 
through Earth. Both kinds of wave travel 
through a solid, but only compressional waves 
travel through a fluid. Thus, unambiguous 
detection of a shear wave travelling through 
the inner core would be once-and-for-all evi- 
dence that it is solid. The inner-core shear wave 
is extremely elusive because of its tortuous 
journey through Earths centre; only a smidgen 
of energy remains after it converts between a 
compressional elastic wave in the fluid outer 
core and a shear elastic wave in the inner core 
on its transit both into and then back out of 
the inner core. 

Seismologists label waves in the mantle and 
crust as P for compression or S for shear; those 
in the outer core as K for compression; and 
those in the inner core as I for compression 
or J for shear. Wookey and Helffrich’ analysed 
PKJKP. In other words, they looked for a com- 
pressional wave emitted from an earthquake 
that travelled down through the crust, mantle 
and outer core (PK); that became a shear wave 
in the inner core (J); and that then travelled 
back up to the surface as a compressional wave 
(KP) where it could be recorded. 

The inner core was discovered about 70 years 
ago by Inge Lehmann’. She used the timing of 
elastic compressional waves to show that the 
speed of these waves increases abruptly at the 
boundary of the inner core. Francis Birch’ 
interpreted this increase as ‘freezing’ of the 
iron, resulting in a solid inner core anda fluid 
outer core. Near the centre of Earth, tempera- 
ture increases slowly with depth, whereas 
pressure increases rapidly. So even though the 
inner core is hotter than the outer core, Birch 
inferred that it is solid because of the very high 
pressure. The observed increase in compres- 
sional-wave speed going into the inner core is 
compatible with that predicted fora fluid—solid 
transition. Although Birch’s interpretation that 
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the inner core is solid is widely accepted, it is not 
definitive. Hence the quest for better evidence. 

Large earthquakes cause Earth to resonate 
like a bell at a set of distinct frequencies, and 
studies of some of these modes has provided 
evidence for the solidity of the inner core. 
However, this kind of analysis is complicated, 
and it is difficult to prove beyond a doubt that 
no model with a fluid inner core could explain 
the results. Which brings us back to the gold 
standard for evidence ofa solid inner core —a 
compelling observation of shear waves pass- 
ing through it. Indeed, there are four previous 
reports of observing a shear wave that went 
through the inner core, two at high frequency*” 
(above 0.1 Hz) and two at low frequency” 
(below 0.1 Hz). The lower-frequency obser- 
vations are arguably more robust, but all such 
observations are challenging because these 
waves are similar in amplitude to many others 
that scatter off irregularities inside Earth. 

Wookey and Helffrich' analysed more than 
700 high-quality seismograms from the Japa- 
nese Hi-net array of borehole seismometers 
(15-100 times as many stations as those used 
in the previous studies). Processing data from 
this array as if it were a downward-looking tele- 
scope allowed the authors to determine the 
precise timing and direction of seismic waves. 
They observed one wave from one earthquake 
that arrived at almost the exact time and direc- 
tion required for it to be an inner-core shear 
wave as predicted by a standard model of wave 
velocity known as ak135 (ref. 8). This was fol- 
lowed, 7 seconds later, by another arrival, which 
they interpreted as a second shear wave that had 
been split by elastic anisotropy in the inner core. 
In an anisotropic medium, a shear wave is split 
into two shear waves with orthogonal polariza- 
tions that travel at different wave speeds. The 
inner core is known to be anisotropic from the 
many analyses of the directional dependence 
of the speed of compressional waves. But this 
is the first reported observation of shear-wave 
splitting in the inner core. 

A surprising result of Wookey and Helffrich’s 
analysis is the relatively high frequencies (0.2- 
0.4 Hz) at which the shear wave is observed. 
Because the inner core is near its melting 
temperature, it is expected to be strongly 
attenuating. Attenuation is characterized 
by a quality factor, Q, which is proportional 
to the number of wave cycles required for 
energy to decay by a factor of 1/e. Most exist- 
ing models predict that attenuation would 
cause PKJKP to be vanishingly small at (and 
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a model has previously been proposed””” to 
explain certain protein-protein interactions. 

Now that protein dynamics has been shown 
to play a role in antibiotic biosynthesis by NRPS 
(and thus probably also PKS) enzymes, the next 
step is to understand which, if any, of the many 
dynamic modes are involved in controlling the 
specificity of interactions between proteins. 
The importance of specific protein-protein 
interactions in such biosyntheses is well estab- 
lished'>”, but a better understanding of them is 
essential if the enzymes are to be rationally re- 
engineered to synthesize new antibiotics. Thus 
far, attention has focused mainly on identifying 
static features that allow protein domains to 
dock selectively to each other, such as regions 
that have complementary shapes or elec- 
tric charges (Fig. 1a). The assumption is that 
such features will be similar for evolutionarily 
related client enzymes, but will be different for 
other enzyme families. Ifso, one should be able 
to replace any individual client enzyme with a 
related enzyme, without perturbing the rest of 
the megasynthase. 

But if protein-protein specificity is the 
result of favourable docking between selected 
equilibrium conformations of the proteins’ 
domains (Fig. 1b), then rational design of mega- 
synthases requires knowledge of an entirely 
different set of issues. For example, how simi- 
lar are the conformational distributions of 
analogous domains in related proteins? Are 
the energy barriers for conformational switch- 
ing comparable for analogous domains across 
an entire family of enzymes? And does selec- 
tive docking always require the same pair of 
compatible conformations? 

The principles of physical organic chemistry 
were founded on the study of small molecules, 
and have provided a strong basis for eluci- 
dating the roles of static features in protein— 
protein recognition. But these ideas are of 
limited use in determining the rules that gov- 
ern protein dynamics. Time will tell whether 
these dynamics are pivotal to the behaviour 
of megasynthases. If we are ever to be able 
to modify such enzymes to prepare new anti- 
biotics, we can only hope that evolution has been 
parsimonious in writing the designrules. ™ 
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EARTH SCIENCE 


Solid evidence in the inner core 


Kenneth C. Creager 


The indications are that a solid ball of iron lies at Earth's centre. But only the 
identification of an elusive seismic signature can confirm the long-standing 


assumption that it is indeed solid. 


It is widely accepted that Earth’s inner core is 
solid. Seismologists have long been trying to 
find direct evidence of this assertion — which 
could come in the form of the identification 
of a particular kind of seismic wave, called a 
shear wave, travelling through the inner core. 
Perhaps, as reported by Wookey and Helffrich 
on page 873 of this issue’, that elusive wave 
has now been teased out of the background 
seismic noise. 

Earth is made up ofa solid crust and man- 
tle overlying outer (fluid) and inner cores that 
consist primarily of iron. Large earthquakes 
send two types of seismic wave — compres- 
sional elastic and shear elastic waves — right 
through Earth. Both kinds of wave travel 
through a solid, but only compressional waves 
travel through a fluid. Thus, unambiguous 
detection of a shear wave travelling through 
the inner core would be once-and-for-all evi- 
dence that it is solid. The inner-core shear wave 
is extremely elusive because of its tortuous 
journey through Earths centre; only a smidgen 
of energy remains after it converts between a 
compressional elastic wave in the fluid outer 
core and a shear elastic wave in the inner core 
on its transit both into and then back out of 
the inner core. 

Seismologists label waves in the mantle and 
crust as P for compression or S for shear; those 
in the outer core as K for compression; and 
those in the inner core as I for compression 
or J for shear. Wookey and Helffrich’ analysed 
PKJKP. In other words, they looked for a com- 
pressional wave emitted from an earthquake 
that travelled down through the crust, mantle 
and outer core (PK); that became a shear wave 
in the inner core (J); and that then travelled 
back up to the surface as a compressional wave 
(KP) where it could be recorded. 

The inner core was discovered about 70 years 
ago by Inge Lehmann’. She used the timing of 
elastic compressional waves to show that the 
speed of these waves increases abruptly at the 
boundary of the inner core. Francis Birch’ 
interpreted this increase as ‘freezing’ of the 
iron, resulting in a solid inner core anda fluid 
outer core. Near the centre of Earth, tempera- 
ture increases slowly with depth, whereas 
pressure increases rapidly. So even though the 
inner core is hotter than the outer core, Birch 
inferred that it is solid because of the very high 
pressure. The observed increase in compres- 
sional-wave speed going into the inner core is 
compatible with that predicted fora fluid—solid 
transition. Although Birch’s interpretation that 
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the inner core is solid is widely accepted, it is not 
definitive. Hence the quest for better evidence. 

Large earthquakes cause Earth to resonate 
like a bell at a set of distinct frequencies, and 
studies of some of these modes has provided 
evidence for the solidity of the inner core. 
However, this kind of analysis is complicated, 
and it is difficult to prove beyond a doubt that 
no model with a fluid inner core could explain 
the results. Which brings us back to the gold 
standard for evidence ofa solid inner core —a 
compelling observation of shear waves pass- 
ing through it. Indeed, there are four previous 
reports of observing a shear wave that went 
through the inner core, two at high frequency*” 
(above 0.1 Hz) and two at low frequency” 
(below 0.1 Hz). The lower-frequency obser- 
vations are arguably more robust, but all such 
observations are challenging because these 
waves are similar in amplitude to many others 
that scatter off irregularities inside Earth. 

Wookey and Helffrich' analysed more than 
700 high-quality seismograms from the Japa- 
nese Hi-net array of borehole seismometers 
(15-100 times as many stations as those used 
in the previous studies). Processing data from 
this array as if it were a downward-looking tele- 
scope allowed the authors to determine the 
precise timing and direction of seismic waves. 
They observed one wave from one earthquake 
that arrived at almost the exact time and direc- 
tion required for it to be an inner-core shear 
wave as predicted by a standard model of wave 
velocity known as ak135 (ref. 8). This was fol- 
lowed, 7 seconds later, by another arrival, which 
they interpreted as a second shear wave that had 
been split by elastic anisotropy in the inner core. 
In an anisotropic medium, a shear wave is split 
into two shear waves with orthogonal polariza- 
tions that travel at different wave speeds. The 
inner core is known to be anisotropic from the 
many analyses of the directional dependence 
of the speed of compressional waves. But this 
is the first reported observation of shear-wave 
splitting in the inner core. 

A surprising result of Wookey and Helffrich’s 
analysis is the relatively high frequencies (0.2- 
0.4 Hz) at which the shear wave is observed. 
Because the inner core is near its melting 
temperature, it is expected to be strongly 
attenuating. Attenuation is characterized 
by a quality factor, Q, which is proportional 
to the number of wave cycles required for 
energy to decay by a factor of 1/e. Most exist- 
ing models predict that attenuation would 
cause PKJKP to be vanishingly small at (and 
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above) 0.3 Hz. As the authors point out, their 
observation may require changes in our ideas 
about the inner-core attenuation: perhaps the 
strongly attenuating part of the inner core is 
concentrated in the upper 400 km; perhaps Q 
for shear waves is much larger (meaning less 
attenuation) at 0.3 Hz than its measured value 
at lower frequencies; or perhaps attenuation 
in pure compression is comparable to that in 
pure shear. 

The inner core is its own world-within-a- 
world; separated from the mantle and crust 
by a fluid outer core, it can behave somewhat 
independently. It seems to be spinning slightly 
faster than the rest of the planet. In addi- 
tion, wave speeds in the inner core depend 
strongly on the direction in which the waves 
travel, implying that much of the inner core is 


composed of strongly aligned crystals. This 
alignment is apparent at scales from 1 km 
— over which it causes waves to scatter — to 
a hemispheric scale, in which compressional 
waves travelling parallel to the spin axis in the 
western hemisphere consistently produce some 
of the largest high-frequency compressional- 
wave travel-time anomalies on Earth. 

Much about the inner core remains 
shrouded from view — for example, it is not 
clear whether crystals are aligned as the inner 
core cools and grows, or by slow shear flow 
at a later stage. Wookey and Helffrich’ may 
well have opened a new door into the dark- 
ness. Whatever the case, the future of this type 
of analysis is bright — facilities such as 
Hi-net, and another seismic-sensing system, 
the EarthScope Transportable Array, offer new 


possibilities for exploring the inner workings 
of Earth’s inner core, as well as other parts of 
our planet. | 
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and Space Sciences, University of Washington, 
Seattle, Washington 98195-1310, USA. 
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PALAEONTOLOGY 


The last giant kangaroo 


Jared Diamond 


Humans who colonized Australia did not reach Tasmania until thousands 
of years later — granting the island's giant kangaroos a brief respite before 
they joined their Australian brethren in oblivion. 


Many of the world’s biggest animals (the mega- 
fauna) became extinct in the latter part of the 
last ice age, which ended around 14,000 years 
ago. That realization triggered two long-stand- 
ing debates’. First, were the extinctions caused 
by humans, climatic or vegetational changes, 
fires, extraterrestrial impacts, or some com- 
bination of those agents? Second, if humans 
were responsible, did they cause the extinc- 
tions directly through hunting, or indirectly 
by some consequences of human settlement? 
Turney et al.” now contribute to answering 
both of these questions. Writing in Proceed- 
ings of the National Academy of Sciences, they 
report improved dating of the demise of giant 
kangaroos and other giant marsupials, and of 
associated changes (or lack of them), on the 
island of Tasmania, off Australia. 

Megafaunal extinction proceeded to different 
extents and at different times on different conti- 
nents. If one defines megafauna as species with 
an adult body mass of more than 44 kilograms, 
then most megafaunal species living in the latter 
part of the last ice age have survived until today 
in Africa and Eurasia. But most or all of them 
became extinct in Australia about 46,000 years 
ago, in North America about 13,000 years ago, 
in South America about 13,000 years ago and 
in New Zealand about 700 years ago. Those four 
dates are close to the respective dates of human 
entry into each of these land masses, suggesting 
that humans were somehow responsible. 

The first humans to colonize these four 
regions were anatomically and behaviourally 
modern humans with sophisticated hunting 


skills, who would have encountered animals 
with no previous experience of humans. Those 
tame animals would have paid the price for 
their lack of fear of these sudden invaders — in 
much the same way, during recent centuries, 
that animals of Antarctica and the Galapagos 
initially had no fear of humans. In contrast, 
the animals of Africa and Eurasia were saved 
by their gradual acquisition of fear of humans 
over millions of years, as human hunting abil- 
ity slowly developed from modest ancestral 
skills such as those of chimpanzees. Although 
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the coincidences in time point the finger at 
humans as the cause of megafaunal extinctions, 
a complication is that extinctions in North and 
South America coincided approximately not 
only with human arrival there, but also with a 
peak of global warming and glacial retreat at 
the end of the last ice age — as well as (in North 
America) with a sediment layer suggestive of 
an extraterrestrial impact’. 

Arguments against a direct human role in the 
extinction of Australia’s megafauna stem from 
reports of vegetational changes and increased 
fires on the Australian mainland around the 
same time*’, and especially from the evidence 
for Tasmania available until now®’. Today, 
Tasmania is an island that lies 240 kilometres 
southeast of the Australian mainland. But 
the seas between Australia and Tasmania are 
so shallow that Tasmania was intermittently 
connected to Australia during glacial epi- 
sodes, when sea level dropped because much 
of the world’s water was locked up in ice. By 
the time humans colonized Australia around 


Figure 1| Two extinct 
species of giant 
kangaroo. The tall 
animal is Protemnodon 
anak, fossils of which 
were excavated by Turney 
et al.’ at the Mount 
Cripps site in Tasmania. 
This species, which 
stood about 2 metres 
tall and weighed 100- 
150 kilograms, survived 
on Tasmania until the 
arrival of humans. The 
other species depicted is 
Protemnodon tumbuna, 
which formerly lived in 
the mountains of New 
Guinea and was the only 
species of kangaroo that 
didn’t hop. (Painting by 
Peter Schouten.) 
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above) 0.3 Hz. As the authors point out, their 
observation may require changes in our ideas 
about the inner-core attenuation: perhaps the 
strongly attenuating part of the inner core is 
concentrated in the upper 400 km; perhaps Q 
for shear waves is much larger (meaning less 
attenuation) at 0.3 Hz than its measured value 
at lower frequencies; or perhaps attenuation 
in pure compression is comparable to that in 
pure shear. 

The inner core is its own world-within-a- 
world; separated from the mantle and crust 
by a fluid outer core, it can behave somewhat 
independently. It seems to be spinning slightly 
faster than the rest of the planet. In addi- 
tion, wave speeds in the inner core depend 
strongly on the direction in which the waves 
travel, implying that much of the inner core is 


composed of strongly aligned crystals. This 
alignment is apparent at scales from 1 km 
— over which it causes waves to scatter — to 
a hemispheric scale, in which compressional 
waves travelling parallel to the spin axis in the 
western hemisphere consistently produce some 
of the largest high-frequency compressional- 
wave travel-time anomalies on Earth. 

Much about the inner core remains 
shrouded from view — for example, it is not 
clear whether crystals are aligned as the inner 
core cools and grows, or by slow shear flow 
at a later stage. Wookey and Helffrich’ may 
well have opened a new door into the dark- 
ness. Whatever the case, the future of this type 
of analysis is bright — facilities such as 
Hi-net, and another seismic-sensing system, 
the EarthScope Transportable Array, offer new 


possibilities for exploring the inner workings 
of Earth’s inner core, as well as other parts of 
our planet. | 
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a complication is that extinctions in North and 
South America coincided approximately not 
only with human arrival there, but also with a 
peak of global warming and glacial retreat at 
the end of the last ice age — as well as (in North 
America) with a sediment layer suggestive of 
an extraterrestrial impact’. 

Arguments against a direct human role in the 
extinction of Australia’s megafauna stem from 
reports of vegetational changes and increased 
fires on the Australian mainland around the 
same time*’, and especially from the evidence 
for Tasmania available until now®’. Today, 
Tasmania is an island that lies 240 kilometres 
southeast of the Australian mainland. But 
the seas between Australia and Tasmania are 
so shallow that Tasmania was intermittently 
connected to Australia during glacial epi- 
sodes, when sea level dropped because much 
of the world’s water was locked up in ice. By 
the time humans colonized Australia around 
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anak, fossils of which 
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tall and weighed 100- 
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50,000 years ago, Tasmania was an island, and 
it did not become reconnected to Australia and 
accessible to human settlement until 43,000 
years ago. Hence, claims that megafaunal spe- 
cies were already extinct in Tasmania before 
humans arrived, if supported, would be fatal 
to the hypothesis that humans eliminated the 
megafauna in Tasmania, and, by extension, in 
Australia. 

Turney et al.” have now compared the 
Tasmanian chronologies of megafaunal extinc- 
tion and human arrival, and lake-core evidence 
of climate and vegetational changes and of fire 
charcoal, thereby providing a record unique in 
the Australian region for correlating all of those 
indicators within a small area. The authors 
reanalysed museum specimens of fossils from 
four previously reported sites, as well as assess- 
ing finds from a newly excavated site at Mount 
Cripps in the northwest of the island. 

The overall Tasmanian record extends from 
the present back to at least 127,000 years ago, 
during the last interglacial period. It turns out 
that, as expected, there were drastic climatic 
and vegetational changes (cool temperate 
rainforest yielding to grassland, herbs and 
heath) around 110,000 years ago, at the transi- 
tion from the last interglacial to the last glacial 
period. Those changes were reversed after the 
transition from glacial back to interglacial con- 
ditions around 15,000 years ago. But the switch 
to glacial climates and vegetation 110,000 years 
ago had no effect on the megafauna’s composi- 
tion. All seven species recorded on Tasmania 
in the last interglacial, more than 127,000 years 
ago, were still present in the most extensive 
glacial fossil deposit, Scotchtown Cave, dated 
at 56,000 years ago — strong evidence against 
the cause of the extinctions being climatic and 
vegetational changes before humans arrived. 
The seven now-vanished giants consisted of 
two 500-kilogram marsupial equivalents of 
a rhinoceros and a ground sloth; three ante- 
lope-like kangaroos weighing 100-150 kilo- 
grams and standing 2 metres tall; a leopard-like 
marsupial predator; and a giant echidna (not a 
marsupial but a monotreme). 

The site at Mount Cripps yielded three speci- 
mens of the giant kangaroo, Protemnodon anak 
(Fig. 1), dated at between 42,900 and 40,900 
years ago, an age statistically identical to the age 
of the oldest archaeological horizon (excava- 
tion level) in Tasmania, as dated by carbon-14, 
that contains evidence of human occupation. 
This discovery supports two conclusions. First, 
despite Tasmania's proximity and vegetational 
similarity to southeastern Australia, mega- 
fauna survived later in Tasmania than on the 
Australian mainland, from which they had 
disappeared about 46,000 years ago. Second, 
in Tasmania the megafauna overlapped in time 
with humans, who probably arrived when the 
land bridge was reestablished 43,000 years 
ago, as indicated by undated human cultural 
artefacts lying below the level of the carbon- 
14-dated human horizon. Thus, megafauna 
that had survived in Tasmania for at least 
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84,000 years (from 127,000 to 43,000 years 
ago), through big changes in vegetation and 
climate, disappeared within a few millennia of 
human arrival. 

The results of Turney et al.’ also point to 
the cause of that event. Unlike the Australian 
mainland, Tasmania shows no signs of changes 
in vegetation or in fire charcoal around the 
time of human arrival. Those changes on the 
mainland have been interpreted*” to mean that 
humans eliminated the mainland megafauna 
indirectly, by lighting fires that altered the char- 
acter of the vegetation. The absence of such 
evidence in Tasmania suggests that humans 
there instead eliminated megafauna directly 
by hunting. Even for the mainland, the role of 
fire-induced vegetational changes as a cause of 
megafaunal extinction is debated: vegetational 
changes might have been the result, rather than 
the cause, of the elimination of large browsing 
and grazing mammals. 

Obviously, our understanding of the chron- 
ology and cause of the Tasmanian megafaunal 
extinction is still far from complete. As sceptics 
will point out, Turney et al. have demonstrated 
the survival of one species of giant kangaroo in 
Tasmania until humans arrived. What about 
the other six species of megafauna? 


I, too, hope for more complete evidence. 
Proof to convince any sceptic would be the 
existence of many Tasmanian sites containing 
human remains dated at 42,800 years ago, all 
lying above deposits lacking human material 
and thus indicating human arrival at that date, 
plus remains of all seven megafaunal species 
at that same date, followed by the last remains 
of those species at seven different dates rang- 
ing from 42,700 to 41,900 years ago. That’s 
asking a lot. But Turney and colleagues’ study 
may motivate Tasmanian palaeontologists and 
archaeologists to seek such overwhelming 
evidence, or else to refute that scenario. | 
Jared Diamond is in the Department of 
Geography, University of California, Los Angeles, 
California 90095-1524, USA. 
e-mail: jdiamond@geog.ucla.edu 
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NANOTECHNOLOGY 


Shaping the void 


Astrid Lambrecht 


A vacuum may be devoid of matter, but its shape is still important. The 
strength of the Casimir force caused by quantum fluctuations in the space 
between surfaces is critically dependent on their nanometre-scale shape. 


Two reflecting plates facing each other in a 
vacuum attract. This startling phenomenon 
is known as the Casimir effect’, named after 
the Dutch physicist Hendrik Casimir who dis- 
covered it in 1948. It is a quantum-mechani- 
cal effect that arises because even a perfect 
vacuum is not entirely empty but filled with 
unavoidable fluctuations of the electromag- 
netic field, imposed by the Heisenberg uncer- 
tainty principle, and imbuing the vacuum 
with a non-zero energy. The reflecting plates 
provide boundaries between which only some 
electromagnetic fluctuations can exist com- 
pared with those in the unrestricted vacuum 
outside the plates. The vacuum energy between 
the plates is thus different from outside, result- 
ing in the Casimir ‘force which scales with the 
fourth power of the inverse distance between 
the plates and is proportional to their surface 
area, making it, strictly speaking, a pressure. 
Writing in Physical Review Letters, Chan 
et al.’ find that the precise shape of the plates 
also has an effect by measuring the Casimir 
force between a gold-covered sphere and a 
nanostructured silicon surface. 
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Casimir forces are relatively long-range in 
comparison to atomic forces, but only become 
measurable at sub-micrometre scales. In the 
mesoscopic regime (~100 nanometres) they 
are quite substantial and have been meas- 
ured with increasing precision during the 
past decade. In their experiment, Chan et al.” 
used a micro-oscillator, a device introduced 
for Casimir-force measurements some years 
ago® 4 Tt consists ofa silicon surface, suspended 
in such a way that it can be set oscillating by 
electrical excitation. Two gold-coated glass 
spheres of radius ~50 um are stacked on top 
of each other and attached to one side of the 
silicon plate. In isolation, this system under- 
goes harmonic oscillations at its resonance 
frequency, but when a surface is brought close 
to the topmost sphere, the Casimir attraction 
between them alters the oscillator’s resonance 
frequency. Chan et al. measured this change 
using different nanostructured silicon surfaces 
containing repeating patterns of square-sided 
grooves around 1 um deep, with a periodicity 
of either 400 nm or 1 um, and brought within 
150-500 nm of the gold sphere. The torsional 
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50,000 years ago, Tasmania was an island, and 
it did not become reconnected to Australia and 
accessible to human settlement until 43,000 
years ago. Hence, claims that megafaunal spe- 
cies were already extinct in Tasmania before 
humans arrived, if supported, would be fatal 
to the hypothesis that humans eliminated the 
megafauna in Tasmania, and, by extension, in 
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of climate and vegetational changes and of fire 
charcoal, thereby providing a record unique in 
the Australian region for correlating all of those 
indicators within a small area. The authors 
reanalysed museum specimens of fossils from 
four previously reported sites, as well as assess- 
ing finds from a newly excavated site at Mount 
Cripps in the northwest of the island. 
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the present back to at least 127,000 years ago, 
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period. Those changes were reversed after the 
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ago had no effect on the megafauna’s composi- 
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ago, were still present in the most extensive 
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vegetational changes before humans arrived. 
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a rhinoceros and a ground sloth; three ante- 
lope-like kangaroos weighing 100-150 kilo- 
grams and standing 2 metres tall; a leopard-like 
marsupial predator; and a giant echidna (not a 
marsupial but a monotreme). 
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mens of the giant kangaroo, Protemnodon anak 
(Fig. 1), dated at between 42,900 and 40,900 
years ago, an age statistically identical to the age 
of the oldest archaeological horizon (excava- 
tion level) in Tasmania, as dated by carbon-14, 
that contains evidence of human occupation. 
This discovery supports two conclusions. First, 
despite Tasmania's proximity and vegetational 
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fauna survived later in Tasmania than on the 
Australian mainland, from which they had 
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in Tasmania the megafauna overlapped in time 
with humans, who probably arrived when the 
land bridge was reestablished 43,000 years 
ago, as indicated by undated human cultural 
artefacts lying below the level of the carbon- 
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84,000 years (from 127,000 to 43,000 years 
ago), through big changes in vegetation and 
climate, disappeared within a few millennia of 
human arrival. 

The results of Turney et al.’ also point to 
the cause of that event. Unlike the Australian 
mainland, Tasmania shows no signs of changes 
in vegetation or in fire charcoal around the 
time of human arrival. Those changes on the 
mainland have been interpreted*” to mean that 
humans eliminated the mainland megafauna 
indirectly, by lighting fires that altered the char- 
acter of the vegetation. The absence of such 
evidence in Tasmania suggests that humans 
there instead eliminated megafauna directly 
by hunting. Even for the mainland, the role of 
fire-induced vegetational changes as a cause of 
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changes might have been the result, rather than 
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the survival of one species of giant kangaroo in 
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human remains dated at 42,800 years ago, all 
lying above deposits lacking human material 
and thus indicating human arrival at that date, 
plus remains of all seven megafaunal species 
at that same date, followed by the last remains 
of those species at seven different dates rang- 
ing from 42,700 to 41,900 years ago. That’s 
asking a lot. But Turney and colleagues’ study 
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archaeologists to seek such overwhelming 
evidence, or else to refute that scenario. | 
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axis of the oscillator was arranged perpen- 
dicularly to the grooves’ orientation in order 
to avoid motion due to a lateral Casimir force 
arising between corrugated surfaces’. The 
measured Casimir forces were smaller than 
those produced between spheres and smooth 
surfaces by a factor varying between 1.5 and 
1.8, depending on the distance and surface 
structure. 

The reduction in force can be explained 
by the simple argument that the top parts of 
the surface alone are close enough to interact 
efficiently with the sphere. However, the tops 
of the surface constitute half of the area of a 
flat surface and would be expected to produce 
half the Casimir force. This reasoning has 
been the basis of a widely used simplification 
known as the ‘proximity force approximation. 
In order to find the Casimir force between bod- 
ies with more complicated shapes than simple 
planes, the proximity force approximation 
splits up the shaped surfaces into (infinitely) 
small planar surfaces and sums the Casimir 
forces between these surface elements as if 
they were independent of each other. The force 
between a sphere and a flat plate then becomes 
the integral of the pressure over the surface 
of the sphere. This approximation has been 
tested experimentally® and works so well that 
it has been used extensively to evaluate the 
Casimir force for numerous experiments in 
which, for practical reasons, the force has been 
measured between a sphere anda plate”*, and 
not in the original Casimir configuration of 
two parallel plates. 

The force between the nanostructured sili- 
con plate and the sphere in the experiment by 
Chan et al.’ is greater than would be predicted 
by the proximity force approximation. This 
provides the first experimental evidence that 
Casimir forces are not in general additive, 
despite being weak forces. Explicit theoretical 
calculations for the force in this configura- 
tion predict a difference twice as large as the 
measured discrepancy if the boundaries are 
considered to be perfect reflectors’, but come 
closer to the experimental result when the 
finite reflectivity of gold and silicon is taken 
into account”. 

Why does the proximity force approximation 
correctly predict the Casimir force between a 
sphere and a flat surface, but not for nanostruc- 
tured surfaces? As often in physics, it is a mat- 
ter of differing length scales — even vacuum 
energy is not immune to this rule. The distance 
between the two boundaries (two plates, plate 
and sphere, or sphere and nanostructured sur- 
face) constitutes a characteristic length scale of 
the system. Vacuum fluctuations are reflected 
from both boundaries, and those fluctuations 
with wavelengths similar to the separation of 
the boundaries contribute most to the Casimir 
force. The spheres used to measure Casimir 
forces have radii of about 50 um, much larger 
than the plate-sphere separations of a few hun- 
dred nanometres over which the Casimir force 
is detectable. The curves of these spheres are 


ona length scale too large to have a noticeable 
effect on the reflection process and the Casimir 
force. But the surfaces used by Chan et al.’ 
have crenellations with periods of 400 nm and 
lum, on the same scale as the distance over 
which the Casimir force operates. Their precise 
shape thus has a substantial influence on the 
reflection of the most relevant electromagnetic 
fluctuations and so alters the strength of the 
Casimir force”’. 

Chan et al.” have shown that nanostructures 
give shape to vacuum energy in a detectable 
way, even if micro- or macrostructures do not. 
It now remains to be seen whether an appropri- 
ate choice of materials and surface structures 
can tune the Casimir force ‘at will’ for nano- 
technological applications. a 
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IMMUNOLOGY 


Surprising side effects 


Michael J. Bevan and Pamela J. Fink 


A drug that normally suppresses an immune response by trapping 
lymphocytes in lymphoid organs results in the elimination of a chronic viral 
infection when applied at low doses. Why should this be? 


During many immune responses, the circula- 
tion of blood lymphocytes, such as T cells, to 
lymph nodes and back to the bloodstream via 
lymphatic vessels is temporarily shut off. Lym- 
phocytes can still enter the nodes, but their 
egress is curtailed, leading to a dramatic decline 
in the number of these cells in the blood — a 
condition called lymphopenia. On page 894 of 
this issue, Premenko-Lanier et al.! report that 
administration of a drug that enhances such 
lymphocyte trapping converts what would be 
a chronic viral infection into one that is rapidly 
cleared by the immune system. The observa- 
tion gives an unexpected twist to the potential 
side effects of this immunosuppressive drug. 


Figure 1| Blocked lymphocyte 
recirculation. During an 
immune response, T cells 
recirculate from the blood 

into the lymph nodes and 

back into the blood again. On 
encountering antigens on an 
antigen-presenting cell (APC), 

T cells that have entered the 
lymph node from the blood 
rapidly divide and differentiate 
into effector cells before re- 
entering the blood. Premenko- 
Lanier et al.' find that FTY720, 
which blocks lymphocyte egress 
from the lymph nodes, somehow 
aids the immune response to 
infection with the clone 13 strain 
of lymphocytic choriomeningitis 
virus. 
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Two closely related strains of a mouse patho- 
gen — lymphocytic choriomeningitis virus 
— differ dramatically in their course of infec- 
tion. The parent strain, known as Armstrong, 
is rapidly cleared by potent killer T cells and 
an accompanying antibody response. The 
derivative strain, clone 13, which differs from 
Armstrong by only two amino-acid residues, 
likewise induces a strong immune response, 
but the virus manages to escape this response 
and infected mice retain high viral titres in their 
blood and tissues for many months. With time, 
high levels of viral antigen reduce or exhaust 
the functionality of antiviral T cells, includ- 
ing their ability to kill infected cells and 
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secrete immune-enhancing molecules called 
cytokines”. 

The impetus for the current work’ came 
when the researchers noticed that, although 
acute infection with Armstrong leads to a 
sharp decline in lymphocyte numbers in the 
blood through efficient trapping in the lym- 
phoid organs, infection with clone 13 virus did 
not cause lymphopenia. Could the failure to 
eliminate clone 13 be due to this difference in 
lymphocyte circulation? 

Lymphocyte egress from lymph nodes is 
controlled by signals from sphingosine- 1-phos- 
phate (S1P), which acts on G-protein-coupled 
cell-surface receptors. A new class of immuno- 
suppressive pro-drug, typified by FTY720, also 
signals via these receptors, blocking lymphocyte 
egress from lymph nodes™ (Fig. 1). Premenko- 
Lanier et al.’ find that administering low doses 
of this drug during the first few days of clone 13 
infection leads to efficient lymphocyte trapping 
in the nodes and clearance of the virus in seven 
days. Even more remarkably, short-term drug 
treatment of mice that have been persistently 
infected with clone 13 virus for 30 days — and 
consequently whose effector T cells are showing 
signs of exhaustion — leads to viral clearance. 

Although the rationale for applying S1P 
analogues was based on the observation that 
clone 13 infection did not lead to lymphocyte 
trapping, it is not clear that this is the explana- 
tion for the drug’s effect. There are five known 
S1P receptors, and these have wide tissue dis- 
tribution. FT Y720 reacts with four of these, 
influencing not only lymphocyte recirculation, 
but also other phenomena such as heart rate 
and vascular integrity’*. Whether the critical 
effect of the drug is on lymphocytes them- 
selves, endothelial cells or another cell type 
is not known. Future analysis of this phenom- 
enon will probably use newer ligands that have 
defined target specificity for S1P receptors. It 
is clear, however, that drug treatment does not 
directly abolish infection in some unsuspected 
way, as results show that drug-induced clear- 
ance requires an intact immune system’. 

Why does clone 13 usually escape the strong 
immune response? Again, the answer is not 
definitely known but may relate to the fact 
that clone 13 has a wider tissue distribution 
than the parent Armstrong virus. It infects 
more cells in the lymph node, such as antigen- 
presenting cells® and reticular cells®, and its 
explosive growth rate may simply overwhelm 
the nascent immune response. One side effect 
of the more widespread infection by clone 13 
may be a disruption of the normal lymphocyte- 
trapping mechanism. It is reasonable to pro- 
pose that enhancing lymphocyte trapping in 
lymph nodes augments the initial response to 
clone 13 infection. In support of this argument, 
the absence of lymph nodes alters the course of 
what would be an acute Armstrong infection 
into a chronic infection’. 

It is harder to imagine how drug treatment 
leads to clearance ofan established infection. 
The authors do not state whether the drug 
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regimen applied at the chronic stage of clone 
13 infection caused lymphopenia by enhanced 
lymphocyte retention in the lymph nodes. At 
this late time point, lymphocytes specific for 
viral antigens are already showing signs of 
exhaustion and must be rejuvenated. It seems 
likely that FTY720 boosts the response, either 
by enhancing positive regulators of immunity 
or by blocking negative regulators. CD4-type 
T cells are necessary for FTY720’s effects on 
clone 13 infection, and these cells are crucial 
for reversing the dysfunction in antiviral killer 
T cells. Whatever the mechanism for the anti- 
viral effect of FTY720 treatment, Premenko- 
Lanier and colleagues’ work’ serves to put 
clinicians who prescribe this drug on notice to 
expect some unusual side effects. i 
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AUTISM 


Family connections 


Daniel H. Geschwind 


Autism is a common neurodevelopmental syndrome with a strong genetic 
component. The study of autistic individuals whose parents are cousins 
highlights the genetic diversity of this condition. 


Genetically complex human disorders, such 
as obesity and Alzheimer’s disease, also occur 
in rare inherited forms — that is, a disease- 
causing mutation arises that can be passed on 
to the next generation. Studying such rare 
mutations is a powerful strategy for gain- 
ing insight into the mechanisms of the more 
common forms ofa disease’. This is especially 
true of autism, for which most associated gene 
mutations that have been identified are either 
rare or cause syndromic forms of the disor- 
der”®. Reporting in Science, Morrow et al.’ 
describe their study of a large group of highly 
consanguineous families — families in which 
there were many marriages between cousins — 
to search for genes connected with increased 
susceptibility to autism. 

To identify rare, disease-causing, recessively 
inherited mutations in consanguineous fami- 
lies, a technique called homozygosity map- 
ping’ is used. In this technique, the genomes 
of a group of subjects with a certain disorder 
are searched for regions that they have in com- 
mon and that are identical (homozygous) on 
both chromosomes of an affected individual. 
It is assumed that such regions are passed on 
from both parents, who shared a recent com- 
mon ancestor. Homozygosity mapping has 
been successfully used to identify the genes 
responsible for certain inherited diseases, such 
as rare disorders of neural development, but it 
has not been widely applied to more common, 
complex genetic disorders. 

Morrow et al.’ aimed to carry out homozy- 
gosity mapping to identify autism genes in 
children of consanguineous marriages. But 
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consanguineous families in which several chil- 
dren have autism are hard to come by, and so 
the first challenge was to find and characterize 
such pedigrees. 

The authors therefore established an impres- 
sive collaborative network throughout the 
Middle East and Eurasia (the Homozygosity 
Mapping Collaborative for Autism; HMCA), 
and identified 88 consanguineous families 
with autistic children. (It is common for cousins 
in Middle Eastern families to marry, and fami- 
lies tend to be large, so the search for inherited 
factors is that much easier.) The authors 
scanned the genomes of their subjects at high 
resolution for structural chromosomal anom- 
alies such as deletions or duplications (also 
called copy-number variations, or CNVs). 

Previous work*” has found that individuals 
with sporadic (non-familial) autism have a 
higher prevalence of de novo CNVs (those not 
observed in parents) than do those with inheri- 
ted autism. Morrow et al. do not find such an 
increase in de novo CNVs in their subjects, 
but they do find an increase in inherited 
CNVs in HMCA families. This, along with 
a low level of de novo events, is consistent 
with inherited, rather than sporadic, autism in 
these families. 

The authors also identify six chromosomal 
regions in the HMCA subjects that harbour 
inherited, homozygous deletions ranging 
from 18 to 880 kilobases in size. The largest 
of these regions removes a gene of unknown 
function called c30rf58, and disrupts the pro- 
moter sequence of another gene, NHE9, which 
encodes a cell-membrane protein. The authors 


NATURE|Vol 454|14 August 2008 


NEWS & VIEWS 


compared the sequence of NHE9 in several 
hundred autistic subjects from non-consan- 
guineous families and controls, and show 
that single base-pair mutations in the gene are 
associated with autism. Their finding therefore 
further supports a role for NHE9 in increasing 
the susceptibility to this disorder. 

The symptoms of autism include commu- 
nication and social deficits, and restricted and 
repetitive behaviour. Clinical features can vary 
widely between individuals, and may include 
epilepsy or developmental regression after a 
period of apparently normal development. To 
simplify their analysis of NHE9, Morrow and 
colleagues focused on a subset of individuals 
who suffered from autism-associated epilepsy. 
It is thought that a subset of individuals who 
also have a particular additional disease feature 
(phenotype) in common might form a more 
genetically similar group. Unlike most of the 
autistic subjects without epilepsy, these indi- 
viduals showed a more than fivefold increase 
in potentially harmful base-pair changes in the 
NHE®9 gene sequence, including a stop codon. 
Therefore, NHE9 may be added to the list of 
genes that are associated with certain subsets 
of autism®. 

Morrow and co-workers’ observations’ 
raise some intriguing questions. Five of the 
six homozygous deletions identified in the 
HMCA subjects lie in regions outside the cod- 
ing sequence of the implicated genes, and so 
are unlikely to disrupt the encoded protein 
sequence. Several of the deletions might dis- 
rupt presumptive regulatory regions, whereas 
another 320-kilobase deletion closest to the 
PCDH10 gene, which encodes the PCD10 
(protocadherin 10) protein, does not lie in a 
coding region or in a sequence of known func- 
tion. The effect of these deletions on neigh- 
bouring genes, and any causal role in autism, 
therefore require further investigation. Also, 
the causal nature of the NHE9 mutations, which 
result in potentially deleterious changes at the 
protein level, must be established, because 
similar changes are observed in control 
subjects, albeit less frequently. 

The difficulty in identifying mutations that 
are definitely pathogenic is an emerging theme 
in common neuropsychiatric disorders. Mor- 
row and colleagues’ study demonstrates that 
studying consanguineous families is a fruitful 
path to identifying autism genes, but also that 
such analysis is more challenging than might 
initially have been expected. This is partly due 
to the underlying genetic heterogeneity of 
autism, a facet underscored by this study. The 
authors find that no single recessive mutation 
is associated with autism in more than one 
family, even in consanguineous families from 
the same geographic region. 

Luckily, advances in DNA sequencing tech- 
nology mean that even large genomic regions 
can be efficiently sequenced, and work to 
identify recessive mutations in single base 
pairs within the genomes of subjects from the 
HMCA group might already be under way. 


Sequencing shared regions in these families 
could be even more informative because many 
mutations in these regions are probably small 
— perhaps single base-pair mutations — and 
thus might more directly implicate specific 
genes. 

Convergent evidence from other studies, 
including functional and gene-expression data, 
will be crucial in assessing the relevance of 
candidate genes to autism'®. Morrow et al. 
embrace this notion, as their putative autism 
susceptibility genes represent diverse func- 
tional classes, which at face value do not suggest 
common molecular mechanisms. To provide 
a potential functional context, they show that 
three of the genes — either near (PCDH10) 
or overlapping (c30rf58 and NHE9) the 
deletions — are among a significant subset 
that is associated with a form of neuronal 
activity thought to be involved in learning 
and memory”. 

The authors suggest that investigating such 
genes might provide a way of unifying seem- 
ingly diverse molecular findings in autism. 
Given the nearly two dozen genes that are now 
associated with the disorder’, this should be a 
readily testable hypothesis. The way in which 
functional data relate to the core deficits of 
autism is not known, because many aspects of 
learning and memory are preserved in autis- 
tic children. The idea that autism might rep- 
resent a general deficit in neuronal plasticity 
caused by changes in gene expression needs to 
be reconciled with the specificity of cognitive 


dysfunction observed in the condition’, as 
well as the preservation of general intellectual 
ability in many individuals. This aspect high- 
lights a central challenge of research in this 
area: connecting basic molecular pathways 
to the properties of complex neural circuits 
and cognition in humans. Spanning these 
diverse levels of function will require detailed 
analysis of the correlation between the genetic 
composition of individuals and their associ- 
ated phenotype across the spectrum of autism 
disorders. a. 
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NEUROSCIENCE 


State-sanctioned synchrony 


Scott J. Cruikshank and Barry W. Connors 


Asleepy brain pays little attention to its surroundings, and its neurons 
are lulled by a common oscillation. As the brain swiftly rouses from this 
sluggish state, its neurons function more independently. 


You know the experience. One moment you 
are drowsy and unfocused, awake but oblivi- 
ous to your environment. Suddenly, something 
snaps you into awareness. Whether the trigger 
is external, like the sight or sound of someone 
entering the room, or internal, such as the 
thought of a looming deadline, you become 
immediately alert. Your ability to deal with 
most tasks, not just the stimulus that aroused 
you, improves, suggesting a change in your 
overall ‘brain state’ 

Understanding the neural mechanisms of 
brain states has been a goal of neuroscientists 
for decades'*. One long-lived suggestion is 
that brain states relate to the degree to which 
neuronal activity in the cerebral cortex is syn- 
chronized‘. On page 881 of this issue, Poulet 
and Petersen’ directly address this issue, show- 
ing, in awake mice, that the electrical activity 
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of adjacent neurons becomes less synchronous 
when the brain enters an active state. 

The long whiskers on a rodent’s snout are 
important sensory organs. When mice are on 
the go, they sweep their whiskers through the 
local environment to feel and identify objects, 
and to navigate in darkness°. Poulet and 
Petersen’ studied brain activity in mice during 
periods of active ‘whisking; when the animals 
rhythmically waved their whiskers back and 
forth, and compared this with periods of ‘quiet 
wakefulness, when the mice (and their whisk- 
ers) were almost motionless (Fig. la). The 
authors managed the technical feat of making 
simultaneous intracellular recordings from 
pairs of cortical neurons in awake, unanaes- 
thetized animals. Intracellular recordings allow 
measurement not only of action potentials (the 
‘firing’ of neurons, signalling their output), but 
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compared the sequence of NHE9 in several 
hundred autistic subjects from non-consan- 
guineous families and controls, and show 
that single base-pair mutations in the gene are 
associated with autism. Their finding therefore 
further supports a role for NHE9 in increasing 
the susceptibility to this disorder. 

The symptoms of autism include commu- 
nication and social deficits, and restricted and 
repetitive behaviour. Clinical features can vary 
widely between individuals, and may include 
epilepsy or developmental regression after a 
period of apparently normal development. To 
simplify their analysis of NHE9, Morrow and 
colleagues focused on a subset of individuals 
who suffered from autism-associated epilepsy. 
It is thought that a subset of individuals who 
also have a particular additional disease feature 
(phenotype) in common might form a more 
genetically similar group. Unlike most of the 
autistic subjects without epilepsy, these indi- 
viduals showed a more than fivefold increase 
in potentially harmful base-pair changes in the 
NHE®9 gene sequence, including a stop codon. 
Therefore, NHE9 may be added to the list of 
genes that are associated with certain subsets 
of autism®. 

Morrow and co-workers’ observations’ 
raise some intriguing questions. Five of the 
six homozygous deletions identified in the 
HMCA subjects lie in regions outside the cod- 
ing sequence of the implicated genes, and so 
are unlikely to disrupt the encoded protein 
sequence. Several of the deletions might dis- 
rupt presumptive regulatory regions, whereas 
another 320-kilobase deletion closest to the 
PCDH10 gene, which encodes the PCD10 
(protocadherin 10) protein, does not lie in a 
coding region or in a sequence of known func- 
tion. The effect of these deletions on neigh- 
bouring genes, and any causal role in autism, 
therefore require further investigation. Also, 
the causal nature of the NHE9 mutations, which 
result in potentially deleterious changes at the 
protein level, must be established, because 
similar changes are observed in control 
subjects, albeit less frequently. 

The difficulty in identifying mutations that 
are definitely pathogenic is an emerging theme 
in common neuropsychiatric disorders. Mor- 
row and colleagues’ study demonstrates that 
studying consanguineous families is a fruitful 
path to identifying autism genes, but also that 
such analysis is more challenging than might 
initially have been expected. This is partly due 
to the underlying genetic heterogeneity of 
autism, a facet underscored by this study. The 
authors find that no single recessive mutation 
is associated with autism in more than one 
family, even in consanguineous families from 
the same geographic region. 

Luckily, advances in DNA sequencing tech- 
nology mean that even large genomic regions 
can be efficiently sequenced, and work to 
identify recessive mutations in single base 
pairs within the genomes of subjects from the 
HMCA group might already be under way. 


Sequencing shared regions in these families 
could be even more informative because many 
mutations in these regions are probably small 
— perhaps single base-pair mutations — and 
thus might more directly implicate specific 
genes. 

Convergent evidence from other studies, 
including functional and gene-expression data, 
will be crucial in assessing the relevance of 
candidate genes to autism'®. Morrow et al. 
embrace this notion, as their putative autism 
susceptibility genes represent diverse func- 
tional classes, which at face value do not suggest 
common molecular mechanisms. To provide 
a potential functional context, they show that 
three of the genes — either near (PCDH10) 
or overlapping (c30rf58 and NHE9) the 
deletions — are among a significant subset 
that is associated with a form of neuronal 
activity thought to be involved in learning 
and memory”. 

The authors suggest that investigating such 
genes might provide a way of unifying seem- 
ingly diverse molecular findings in autism. 
Given the nearly two dozen genes that are now 
associated with the disorder’, this should be a 
readily testable hypothesis. The way in which 
functional data relate to the core deficits of 
autism is not known, because many aspects of 
learning and memory are preserved in autis- 
tic children. The idea that autism might rep- 
resent a general deficit in neuronal plasticity 
caused by changes in gene expression needs to 
be reconciled with the specificity of cognitive 


dysfunction observed in the condition’, as 
well as the preservation of general intellectual 
ability in many individuals. This aspect high- 
lights a central challenge of research in this 
area: connecting basic molecular pathways 
to the properties of complex neural circuits 
and cognition in humans. Spanning these 
diverse levels of function will require detailed 
analysis of the correlation between the genetic 
composition of individuals and their associ- 
ated phenotype across the spectrum of autism 
disorders. a. 
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Asleepy brain pays little attention to its surroundings, and its neurons 
are lulled by a common oscillation. As the brain swiftly rouses from this 
sluggish state, its neurons function more independently. 


You know the experience. One moment you 
are drowsy and unfocused, awake but oblivi- 
ous to your environment. Suddenly, something 
snaps you into awareness. Whether the trigger 
is external, like the sight or sound of someone 
entering the room, or internal, such as the 
thought of a looming deadline, you become 
immediately alert. Your ability to deal with 
most tasks, not just the stimulus that aroused 
you, improves, suggesting a change in your 
overall ‘brain state’ 

Understanding the neural mechanisms of 
brain states has been a goal of neuroscientists 
for decades'*. One long-lived suggestion is 
that brain states relate to the degree to which 
neuronal activity in the cerebral cortex is syn- 
chronized‘. On page 881 of this issue, Poulet 
and Petersen’ directly address this issue, show- 
ing, in awake mice, that the electrical activity 
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of adjacent neurons becomes less synchronous 
when the brain enters an active state. 

The long whiskers on a rodent’s snout are 
important sensory organs. When mice are on 
the go, they sweep their whiskers through the 
local environment to feel and identify objects, 
and to navigate in darkness°. Poulet and 
Petersen’ studied brain activity in mice during 
periods of active ‘whisking; when the animals 
rhythmically waved their whiskers back and 
forth, and compared this with periods of ‘quiet 
wakefulness, when the mice (and their whisk- 
ers) were almost motionless (Fig. la). The 
authors managed the technical feat of making 
simultaneous intracellular recordings from 
pairs of cortical neurons in awake, unanaes- 
thetized animals. Intracellular recordings allow 
measurement not only of action potentials (the 
‘firing’ of neurons, signalling their output), but 
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also of smaller electrical fluctuations, which 
mainly reflect inputs from other nerve cells. 
Poulet and Petersen’ found that these sub- 
threshold fluctuations are highly correlated 
among cortical cells during quiet wakefulness 
and relatively desynchronized during whisk- 
ing (Fig. 1b). 

A correspondence between quiet brain 
states and strong cortical synchrony was the 
mainstream view of neuroscientists from the 
earliest days of electroencephalography (EEG, 
a method for recording extracellular voltages 
from the surface of the skull) until about 
20 years ago. EEG pioneers referred to the large 
slow-waves associated with restful behaviour 
as ‘synchronized’. Signals from individual 
neurons are tiny, so it was believed that syn- 
chronous activity of many neurons would be 
needed to give large measurable signals at EEG 
electrodes located outside the skull*”. More 
recently, simultaneous EEG and intracellular 
recordings within the cortex during quiescent 
states (anaesthesia, sleep and quiet wakeful- 
ness) have demonstrated that slow oscilla- 
tions of intracellular membrane potentials 
correlate well with EEG oscillations. More- 
over, when brain states become active, either 
spontaneously or after stimulation, slow EEG 
waves and membrane-potential fluctuations 
diminish simultaneously*"”. 

Because the EEG and the membrane poten- 
tials of most cortical neurons are strongly 
correlated, it seems very likely that slow mem- 
brane-potential oscillations are synchronized 
among cortical cells during quiescent states. 
In fact, synchrony among pairs of neurons has 
already been directly demonstrated in anaes- 
thetized animals'’’”. However, anaesthetics 
have powerful and poorly understood effects 
on neural circuits, and the cellular properties 
of the normally behaving brain have been par- 
ticularly difficult to measure. This is why the 
work of Poulet and Petersen, measuring mem- 
brane potentials from pairs of cortical neurons 
in awake animals, is so important. 

The decrease in membrane-potential cor- 
relation, observed when mice switched from 
quiet wakefulness to active whisking, was 
mainly due to decreased synchrony of low- 
frequency oscillations. During whisking, there 
was no obvious increase in relatively high- 
frequency (30-100 hertz) oscillations known 
as ‘gamma’ rhythms, or correlations of the 
same. This seems at odds with several previ- 
ous studies of cortical activation (including one 
from the Petersen group’). In fact, since the 
late 1980s, an extensive literature has reported 
that activated brain states are accompanied by 
enhancement of gamma rhythms and their 
synchrony’*’*. 

The absence of enhanced gamma rhythms 
in the current study may reflect the ambiguous 
nature of the whisking state in these experi- 
ments. EEG measurements suggest that whisk- 
ing periods might correspond to relatively 
active brain states. For instance, EEG fluctua- 
tions during quiet wakefulness have relatively 
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Figure 1| Brain state and cortical synchrony. 

a, Quiet wakeful states of the mouse brain 

lack whisker movements; active brain states 

are indicated by whisking. b, Simultaneous 
recording from adjacent cortical neurons 
(pictured in c) show synchronous oscillations 

in membrane potential during quiet states. This 
synchrony breaks down during active (whisking) 
states. c, The synchronous, low-frequency 
cortical activity is driven by a neural ‘pacemaker’ 
(circle). The strength of this pacemaker is 
regulated by brain state (stronger during quiet 
wakefulness, weaker during whisking). Specific, 
information-rich pathways (various colours) 
generate relatively high-frequency, asynchronous 
synaptic inputs to neurons that dominate during 
active brain states. 


high amplitudes and low frequencies (features 
classically associated with passive brain states), 
whereas the fluctuations during whisking tend 
to be smaller and more irregular’ (features of 
active states). However, the heads of the mice 
had to be fixed in position for the duration of 
Poulet and Petersen's experiments’. No stimuli 
were provided, and the whisking apparently 
began and ended spontaneously, making the 
behavioural significance of this ‘active’ state 
uncertain®, 

Poulet and Petersen’ also show that elimi- 
nating sensory feedback from the whisker 
follicles, by severing the infraorbital nerve, 
affects neither state transitions themselves nor 
state-dependent changes in synchrony, even 
though all recordings are from the area of 
the cortex that processes this type of sensory 
information. The synaptic drives that mediate 
low-frequency synchrony are thus triggered 
centrally rather than by the direct action of the 
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sensory input on the cortex (Fig. 1c), consistent 
with much earlier studies””’. However, Poulet 
and Petersen did not report whether the acti- 
vation is cortex-wide (for example, including 
visual, auditory, prefrontal and limbic areas) or 
limited to whisker-processing regions. Some 
studies have shown that specific movements 
can be correlated with the localized activa- 
tion of cortical areas associated with these 
motions’*. Something similar may be happen- 
ing here because the active state was defined 
by whisker movement itself. There may well be 
important mechanistic differences between the 
sort of activation produced by general ‘arousal’ 
systems of the brain?*"””, and activation medi- 
ated by specific processes such as sensory- 
motor coordination. 

This raises the issue of mechanisms more 
generally. Which neuromodulatory systems, 
if any, are involved in the state transitions? 
Where do the signals driving membrane- 
potential synchrony originate, and to what 
extent are they mediated by state-dependent 
excitatory or inhibitory processes'*? And how 
do fluctuations in synchrony affect sensory 
processing’? 

The relationship between brain state and 
cortical synchrony has been investigated for 
at least 80 years. This week’s study by Poulet 
and Petersen’ used direct measurements in 
a field that had been dominated by oblique 
techniques. This may, to paraphrase Winston 
Churchill, mark ‘the end of the beginning’ of 
an epic scientific problem. a 
Scott J. Cruikshank and Barry W. Connors are 
inthe Department of Neuroscience, Brown 
University, Providence, Rhode Island 02912, USA. 
e-mails: scott_cruikshank@brown.edu; 
barry_connors@brown.edu 
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Genomics of cellulosic biofuels 


Edward M. Rubin! 


The development of alternatives to fossil fuels as an energy source is an urgent global priority. Cellulosic biomass has the 
potential to contribute to meeting the demand for liquid fuel, but land-use requirements and process inefficiencies represent 
hurdles for large-scale deployment of biomass-to-biofuel technologies. Genomic information gathered from across the 

biosphere, including potential energy crops and microorganisms able to break down biomass, will be vital for improving the 


prospects of significant cellulosic biofuel production. 


he capture of solar energy through photosynthesis is a pro- 

cess that enables the storage of energy in the form of cell wall 

polymers (that is, cellulose, hemicellulose and lignin). The 

energy stored in these polymers can be accessed in a variety 
of ways, ranging from simple burning to complex bioconversion 
processes. The high energy content and portability of biologically 
derived fuels, and their significant compatibility with existing pet- 
roleum-based transportation infrastructure, helps to explain their 
attractiveness as a fuel source. Despite the increasing use of biofuels 
such as biodiesel and sugar- or starch-based ethanol, evidence sug- 
gests that transportation fuels based on lignocellulosic biomass rep- 
resent the most scalable alternative fuel source’. Lignocellulosic 
biomass in the form of plant materials (for example, grasses, wood 
and crop residues) offers the possibility of a renewable, geograph- 
ically distributed and relatively greenhouse-gas-favourable source of 
sugars that can be converted to ethanol and other liquid fuels. 
Calculations of the productivity of lignocellulosic feedstocks, in part 
based on their ability to grow on marginal agricultural land, indicates 
that they can probably have a large impact on transportation needs 
without significantly compromising the land needed for food crop 
production’. 

Lignocellulosic biofuel production involves collection of biomass, 
deconstruction of cell wall polymers into component sugars (pre- 
treatment and saccharification), and conversion of the sugars to 
biofuels (fermentation) (Fig. 1). Partially because of the historically 
low demand for biologically based transportation fuels, each step in 
this process is in the early stages of optimization for efficiency and 
throughput. The crops from which biomass is currently derived have 
not been domesticated for this particular purpose and the present 
methods for saccharification and fermentation are inefficient and 


Physical pre-treatment, 
chemicals and enzymes 


Fuel-producing 
microorganisms 


expensive. However, the recent and pressing desire to develop alter- 
natives to fossil fuels has made the rapid improvement of biofuel 
production a high priority, in which biologically derived energy 
(‘bioenergy’)-relevant genomic insights and resources will have an 
important role (Table 1). 


Biomass 


From the perspective of transportation fuels, plants can be viewed as 
solar energy collectors and thermochemical energy storage systems. It 
is the storage of energy in a form that can later be accessed via 
thermochemical or enzymatic conversion that distinguishes biomass 
from other renewable energy sources. Cellulosic biomass, sometimes 
referred to as lignocellulosic biomass, is an abundant renewable 
resource that can be used for the production of alternative trans- 
portation fuels’. The three main components of lignocellulose are 
cellulose, hemicellulose and lignin (Fig. 2), with the relative propor- 
tions of the three dependent on the material source’. Cellulose, the 
main structural component of plant cell walls, is a long chain of 
glucose molecules, linked to one another primarily by glycosidic 
bonds’. Hemicellulose, the second most abundant constituent of 
lignocellulosic biomass, is not a chemically well defined compound 
but rather a family of polysaccharides, composed of different 5- and 
6-carbon monosaccharide units, that links cellulose fibres into 
microfibrils and cross-links with lignin, creating a complex network 
of bonds that provide structural strength’. Finally lignin, a three- 
dimensional polymer of phenylpropanoid units, can be considered 
as the cellular glue providing the plant tissue and the individual fibres 
with compressive strength and the cell wall with stiffness®, in addition 
to providing resistance to insects and pathogens. 


Figure 1 | Biology of bioconversion 
of solar energy into biofuels. Solar 
energy is collected by plants via 
photosynthesis and stored as 
lignocellulose. Decomposition of 
the cellulosic material into simple 5- 
and 6-carbon sugars is achieved by 
physical and chemical pre- 
treatment, followed by exposure to 
enzymes from biomass-degrading 
organisms. The simple sugars can 
be subsequently converted into 
fuels by microorganisms. 


'DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, California 94598, USA. *Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, California 94720, USA. 
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Organism Genome size Status Reference 
(megabases) 
Feedstocks and feedstock models 
Populus trichocarpa (poplar) 480 Complete Ref. 9 
Chlamydomonas reinhardtii 120 Complete Ref. 34 
Glycine max (soya bean) 1,200 Draft - 
Manihot esculenta (cassava) 770 n progress - 
Sorghum bicolor 760 n progress - 
Eucalyptus globulus 600 n progress - 
Brachypodium distachyon S55 n progress - 
Zea mays (maize) 2,500 n progress - 
Elaeis guineensis (oil palm) ~3,400 n progress http://www.checkbiotech.org/green_News_Biofuels.aspx?infold=15100 
Panicum virgatum (switchgrass) ~5,600 n progress - 
Setaria italica (foxtail millet) ~515 n progress - 
Biomass degraders 
Acidothermus cellulolyticus 11B 24 Complete - 
Bacillus pumilis SAFR-032 ae Complete Ref. 35 
Caldicellulosiruptor saccharolyticus DSM 8903 3.0 Complete - 
Clostridium phytofermentans |SDg 48 Complete - 
Clostridium thermocellum ATCC 27405 3.8 Complete - 
Cytophaga hutchinsonii ATCC 33406 44 Complete - 
Flavobacterium johnsoniae UW101 lel Complete - 
Rubrobacter xylanophilus DSM9941 3.2 Complete - 
Saccharophagus degradans Sail Complete Ref. 36 
Thermobifida fusca strain YX 3.6 Complete Ref. 37 
Clostridium cellulolyticum H10 4.0 Draft - 
Elusimicrobium minutum Pei191 1.6 Draft = 
Nectria haematococca/Fusarium solani il Draft - 
Phanerochaete chrysosporium Bion! Draft - 
Postia placenta 33 Draft - 
Sagittula stellata E-37 oo Draft - 
Trichoderma reesei/Hypocrea jecorina 33 Draft - 
Cellulomonas flavigena DSM 20109 ~A0 n progress - 
Cellvibrio japonicus Ueda107 ~6.0 n progress - 
Fibrobacter succinogenes subsp. succinogenes S85 =3.8 n progress - 
Ruminococcus albus 4.0 n progress - 
Teredinibacter turnerae T7902 2 n progress - 
Termite hindgut community NA Complete Ref. 23 
Poplar biomass degrading community NA n progress http://www.jgi.doe.gov/sequencing/Ispssseqplans2007.html 
Asian longhorned beetle (Anoplophora glabripennis) gut NA n progress http://www.jgi.doe.gov/sequencing/ DOEmicrobes2007.html 
community 
Bovine rumen community transcriptome NA n progress http://www.energybiosciencesinstitute.org/ 
index.php?option=com_content&task=view&id=159&ltemid=20 
Fuel producers 
Clostridium acetobutylicum ATCC 824 40 Complete Ref. 38 
Clostridium beijerinckii NCIMB 8052 6.0 Complete - 
Pichia stipitis 15.4 Complete Ref. 27 
Thermoanaerobacter tengcongensis MB4 27 Complete Ref. 39 
Zymomonas mobilis subsp. mobilis ZM4 21 Complete Ref. 40 
Bacillus coagulans 36D1 txS) Draft - 
Thermoanaerobacter pseudethanolicus 39E 24 Draft - 
Clostridium Ijungdahlii ~4.0 In progress - 


Bioenergy-relevant organisms for which large-scale genome projects have been completed or are under way are listed. Information on genome projects without references can be found at http:// 


www.ncbi.nlm.nih.gov/sites/entrez?db=genomeprj. 


As we can retrospectively view the features that made certain wild 
plants desirable for domestication thousands of years ago to become 
today’s food crops, we are now prospectively defining criteria to 
choose plants with potential to serve as dedicated bioenergy crops 
in the future. These include cell wall composition, growth rate, suit- 
ability for growth in different geographical regions, and resource-use 
efficiencies. With these features in mind, a list of potential bioenergy 
crops is being developed and targeted for different growing condi- 
tions’. Most plants assimilate their CO, first into a C; compound, 
whereas a smaller subset use a Cy compound. Plants using C, pho- 
tosynthesis tend to be among the most productive, having higher 
maximum efficiencies of light, nitrogen and water use in assimilating 
carbon. The C, group of potential energy crops includes various 
perennial grasses such as switchgrass and Miscanthus. These grasses 
have the advantages of not requiring replanting after a yearly harvest, 
rapid growth, high biomass density per unit area, and low nutrient 
and water needs, enabling growth on marginal agricultural land. 
Disadvantages are that C, plants are rare in cold climates and unable 
to grow at temperatures below 10°C. In these environments, trees, 
which exclusively depend on C; photosynthesis, provide the only 
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candidate species. The C; group of potential energy crops includes 
trees, such as poplar and eucalyptus, which have relatively rapid 
growth potential in difficult-to-plough environments. It is highly 
likely that multiple different energy feedstocks will be deployed 
depending on latitude, geography, water availability and landowner 
acceptance. 

Until recently, minimal effort has been directed towards optim- 
izing potential energy crops for the generation of transportation 
fuels. This is in stark contrast to the agronomic development of food 
crops, which have been domesticated for thousands of years to max- 
imize productivity. Teosinte, the wild precursor to modern maize, 
was first recognized by Native Americans more than 5,000 years ago 
as a potential food crop. The domestication of teosinte resulted in its 
conversion from a wild plant, the characteristics of which had been 
orchestrated by natural selection maximizing survival and reproduc- 
tion, into a plant whose morphology and physiology had been exten- 
sively altered by artificial selection to increase its nutritional yield and 
ease of harvest*. More recently, selective breeding as well as agro- 
nomic advances have resulted in improvement over several orders 
of magnitude in the nutritional value per acre of modern maize 
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Figure 2 | Structure of lignocellulose. The main component of 
lignocellulose is cellulose, a B(1—4)-linked chain of glucose molecules. 
Hydrogen bonds between different layers of the polysaccharides contribute 
to the resistance of crystalline cellulose to degradation. Hemicellulose, the 
second most abundant component of lignocellulose, is composed of various 
5- and 6-carbon sugars such as arabinose, galactose, glucose, mannose and 
xylose. Lignin is composed of three major phenolic components, namely 


compared to that of teosinte. Some of the most rapid increases have 
occurred in the past 40 years, both from advances in agronomic 
practices and, importantly, from the application of modern genetics. 
The optimization of bioenergy crops as feedstocks for transportation 
fuels is in its infancy, but already genomic information and resources 
are being developed that will be essential for accelerating their 
domestication. Many of the traits targeted for optimization in poten- 
tial cellulosic energy crops are those that would improve growth on 
poor agricultural lands, to minimize competition with food crops 
over land use. 

Populus trichocarpa (poplar), the first tree and potential bioenergy 
crop to have its genome sequenced (Table 1)”, illustrates some of the 
issues and potential of applying genomics to the challenge of optim- 
izing energy crops. The traits for which the genetic underpinnings 
will be sought in the genomes of bioenergy-relevant plants, such as 
poplar, include those affecting growth rates, response to competition 
for light, branching habit, stem thickness and cell wall chemistry. 
Significant effort will go into maximizing biomass yield per unit land 
area, because this more than any other factor will minimize the 
impact on overall land use. One can imagine trees optimized to have 
short stature to increase light access and enable dense growth, large 
stem diameter, and reduced branch count to maximize energy den- 
sity for transport and processing. Trees have evolved with highly rigid 
and stable cell walls due to heavy selective pressure for long life and an 
upright habit. Plants domesticated for energy production, with a 
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p-coumaryl alcohol (H), coniferyl alcohol (G) and sinapyl alcohol (S). Lignin 
is synthesized by polymerization of these components and their ratio within 
the polymer varies between different plants, wood tissues and cell wall layers. 
Cellulose, hemicellulose and lignin form structures called microfibrils, 
which are organized into macrofibrils that mediate structural stability in the 
plant cell wall. 


crop cycle time of only a few years, would have less need for a rigid 
cell wall than wild plants with lifetimes of a hundred years or more. 
Alterations in the ratios and structures of the various macromole- 
cules forming the cell wall are a major target in energy crop domest- 
ication to facilitate post-harvest deconstruction at the cost of a less 
rigid plant. 

Already, by comparing several of the presently available plant gen- 
omes (poplar’, rice®”’, Arabidopsis"*; see Table 1) coupled with large- 
scale plant gene function and expression studies, a number of can- 
didate genes for domestication traits have been identified'*'*. These 
include many genes involved in cellulose and hemicellulose synthesis 
as well as those believed to influence various morphological growth 
characteristics such as height, branch number and stem thickness". 
In addition to homology-based strategies, other genome-enabled 
strategies for identifying domestication candidate genes are being 
used. These include quantitative trait analysis of natural variation 
and genome-wide mutagenesis coupled with phenotypic screens 
for traits such as recalcitrance to sugar release, acid digestibility 
and general cell wall composition. The availability of high-through- 
put transgenesis in several plant systems’® will facilitate functional 
studies to determine the in vivo activities of the large number of 
domestication candidate genes. Using these strategies, genes affecting 
features such as plant height, stem elongation and trunk radial 
growth, drought tolerance, and cell wall stability are but a few of 
the features that are likely to be identified as targets for domestication 
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in a fraction of the time required to carry out similar studies unaided 
by the plant genomes and genomic approaches”. 


Biomass degradation 


The breakdown of biomass involves the release of long-chain poly- 
saccharides, specifically cellulose and hemicellulose, and the sub- 
sequent hydrolysis of these polysaccharides into their component 
5- and 6-carbon chain sugars. Early chemical processes for cellulose 
degradation depended heavily on acid treatments”"*, and even today 
industrial processes for biomass degradation involve heat and acidic 
conditions and tend to be expensive, slow and relatively inefficient’’. 
Furthermore, some of these pre-treatments produce inhibitors (that 
is, furfural and 5-hydroxymethylfurfural) that decrease the overall 
yield of the fermentation process. 

The human need for efficient breakdown of lignocellulosic bio- 
mass for biofuel production is relatively new; however, a variety of 
organisms have evolved to take advantage of this nutrient source, 
including the free-living organisms and symbiotic animal—microbe 
consortia invariably present in biomass-rich environments. 
Increasing our knowledge of the biochemical machinery used by 
these organisms for the breakdown of biomass offers new avenues 
for the development of biologically based processes that could poten- 
tially accomplish biomass conversion at an industrial scale. Just as 
plant genomes are providing us with new approaches for accelerating 
feedstock domestication, genomics of biomass-degrading microbes 
and microbial consortia offer a new means to enzyme discovery. 

Microbial strategies for degrading lignocellulose are diverse, yet 
our current understanding of the enzymes involved in these processes 
is limited to a handful of model organisms such as the fungus 
Trichoderma reesei and the bacterium Clostridium thermocellum”?. 
Recent genome sequencing of these two biomass degraders, as well 
as a host of other cellulolytic species (Table 1), has expanded our 
repertoire of known or potential cellulolytic enzymes. The knowledge 
of biomass degradation pathways is soon to be increased even more 
by a number of large-scale genomic surveys under way of isolated 
organisms and microbial communities known to degrade biomass 
(Table 1). 

Considering the current dependence on acid and heat pre-treat- 
ment in the deconstruction of lignocellulose, it is clear that enzymes 
that are stable and active at low pH values and at high temperatures 
are of particular value. Thus, enzymes derived from thermophilic and 
acidophilic organisms known to degrade lignocellulose, such as 
Caldicellulosiruptor saccharolyticus and Acidothermus cellulolyticus, 
hold significant promise for industrial processes*’. However, the 
number of sequenced thermoacidophilic organisms is low, partly 
because of the significant obstacles in cultivating these organisms 
in the laboratory. 

Many novel enzymes and enzyme systems that have evolved to 
make use of cellulosic biomass are present in difficult-to-culture 
microbes”. This is particularly the case for the communities of 
microbes inhabiting the guts of lignocellulose-consuming insects 
and ruminants. Recently, in response to the challenge of studying 
difficult-to-culture organisms, metagenomic approaches have been 
developed to access the information encoded in their genomes 
(Table 1). This typically involves the high-throughput sequencing 
of DNA extracted directly from the mixtures of organisms present 
in an environment. 

The metagenomic analysis of a wood-feeding higher termite”’ has 
recently revealed hundreds of genes encoding carbohydrate-active 
enzymes, many of which were subsequently shown to be biochemically 
active. From the perspective of an industrial application, the termite 
hindgut is too complex a model to mimic because it comprises hun- 
dreds of microbial species and lignocellulose-degrading enzymes. 
However, termites tend to feed on complex diets of several plant 
species, which translate to more complex hindgut communities and 
enzyme inventories. Thus, examination of this single community 
reveals enzymes capable of hydrolysing a broad assortment of chemical 
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bonds. The saccharification of a single uniform cellulosic feedstock, 
particularly one engineered to have a simplified cell wall, will presum- 
ably require fewer enzymes and less downstream optimization of gene 
regulation and enzyme activity in a production host. 

The availability of a wide range of naturally occurring lignocellu- 
lose-degrading enzymes increases the chances of successful enzyme 
optimization for industrial processes. Optimization of the sacchar- 
ification process is crucial because the cost of cellulases remains a key 
barrier to economical production of biofuels’. A more diverse set of 
candidate enzymes—identified through a combination of conven- 
tional cultured microbial genome studies coupled with envir- 
onmental prospecting methods such as metagenomics—improves 
the likelihood of obtaining enzymes with activities and stability sui- 
ted to a variety of industrial processes. 


Biofuel production 


Although the final steps of cellulosic ethanol production will require 
much of the same infrastructure developed for the production of 
sugar- and starch-based ethanol, changes will need to be made to 
exploit the diversity of sugars generated from the breakdown of bio- 
mass. Whereas the conversion of starch-based feedstock results prim- 
arily in hexoses, the products from degradation of lignocellulosic 
biomass, composed in part of hemicellulose, include large amounts 
of the pentose sugars D-xylose and L-arabinose. In contrast to the 
hexose sugars, the pentose sugars cannot be fermented by wild-type 
Saccharomyces cerevisiae*. Another factor limiting ethanol yield is 
the toxicity of ethanol to the fermenting host. Most fermenting 
organisms suchas S. cerevisiae cannot tolerate ethanol concentrations 
exceeding 25% (v/v), resulting in a product that must then be con- 
centrated through distillation. Distillation represents an expensive 
and energy-intensive step in ethanol production. 

These limitations in ethanol production have generated significant 
interest in developing new organisms able to exploit fully the break- 
down products of lignocellulosic biomass as well as tolerate the pro- 
ducts of fermentation”. Pichia stipitis represents one yeast species of 
relevance to biofuel research based on its natural ability to ferment 
the pentose sugar xylose. Its recently sequenced genome revealed 
insights into the metabolic pathways responsible for this process”, 
and investigators are already working on the optimization of this 
pathway in P. stipitis as well as on the construction of systems for 
the heterologous expression of P. stipitis genes. Escherichia coli has 
already been genetically engineered for the conversion of all hexose 
and pentose sugars present in hemicellulose polymers”. The result- 
ing genetically engineered strain has an ethanol production rate sim- 
ilar to yeast”, and its ethanol tolerance has been increased by 
selection on enrichment media”. In the future, genomics and path- 
way engineering should considerably facilitate the development of a 
variety of organisms able to use the full repertoire of cellulosic and 
hemicellulosic sugars and tolerate high ethanol concentrations to 
optimize ethanol yields. 

Although ethanol production facilities and distribution centres are 
rapidly increasing in number, ethanol as a transportation fuel has 
several disadvantages that encourage the pursuit of better suited 
alternative biofuels in the future. Disadvantages of ethanol as a bio- 
fuel include: (1) high solubility of ethanol in water, which necessitates 
an energy-intensive distillation step; (2) diminished energy per gal- 
lon content of ethanol compared to petroleum; (3) difficulties in the 
distribution of ethanol via today’s pipeline infrastructure owing to its 
hygroscopic nature; and (4) incompatibilities with many current 
vehicles at higher blending volumes. Thus, work is under way to 
enable the synthesis of next-generation biofuel products, such as 
higher-chain alcohols, alkanes, and other molecules with structures 
and activities closer to that of petroleum and diesel. Butanol, for 
example, can be used in conventional engines either in pure form 
or mixed with petroleum, can be transported by existing pipelines, 
and has a higher energy content than petroleum*'. A number of 
organisms, such as Clostridium acetobutylicum, produce butanol 
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from sugars via the ABE (acetone, butanol, ethanol) fermentation 
and can be manipulated to produce mostly butanol*'. E. coli has 
recently been engineered to produce isobutanol and other alcohols 
via a non-fermentative pathway that may be more readily adapted to 
large-scale production, using heterologous expression of Lactococcus 
lactis and Bacillus subtilis genes. The genomics-enabled construc- 
tion of partial or fully artificial biological systems through ‘synthetic 
biology’ approaches will be key in developing efficient, inexpensive 
biofuel production systems”’. 


Summary 


This perspective has focused on biofuels derived from lignocellulosic 
feedstocks; however, another potential source of transportation fuels 
is biodiesel, derived from oil-producing plants and microbes. 
Predominant sources of biodiesel presently include soya bean, rape- 
seed and palm oils. The genomes of soya bean and oil palm are both 
currently being sequenced (Table 1), information from which will be 
called on to increase oil production and facilitate plant growth in a 
broader range of habitats. Oil-producing algae are also under active 
study as an alternative source for biodiesel production. One of the 
major advantages of algae is their growth in liquid, negating the issue 
of potential competition between food and energy crops for land use. 
Chlamydomonas reinhardtii is one of a series of algal species whose 
genomes are becoming available for analysis**. Considering the mag- 
nitude of the transportation fuel problem, biodiesel is likely to con- 
tribute at least in part to the solution, and genomic research on oil 
producers will aid in making biodiesel production cheaper and more 
efficient. 

Placing a man on the Moon and sequencing the human genome 
represented large-scale technologically challenging projects. In both 
these instances, substantial research and development efforts resulted 
in technological advances unforeseen at the projects’ initiation. The 
strategies that were pioneered in sequencing and interpreting the 
human genome for the improvement of human health are now 
poised to be an important contributing technology in the challenge 
to develop environmentally and socially acceptable alternatives to 
fossil fuels. 
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UCP2 mediates ghrelin's action on NPY/ 
AgRP neurons by lowering free radicals 


Zane B. Andrews’, Zhong-Wu Liu’”, Nicholas Walllingford’, Derek M. Erion’, Erzsebet Borok’, 
Jeffery M. Friedman®, Matthias H. Tschép’, Marya Shanabrough’, Gary Cline’, Gerald I. Shulman’, Anna Coppola’, 


Xiao-Bing Gao”, Tamas L. Horvath’** & Sabrina Diano”* 


The gut-derived hormone ghrelin exerts its effect on the brain by regulating neuronal activity. Ghrelin-induced feeding 
behaviour is controlled by arcuate nucleus neurons that co-express neuropeptide Y and agouti-related protein (NPY/AgRP 
neurons). However, the intracellular mechanisms triggered by ghrelin to alter NPY/AgRP neuronal activity are poorly 
understood. Here we show that ghrelin initiates robust changes in hypothalamic mitochondrial respiration in mice that are 
dependent on uncoupling protein 2 (UCP2). Activation of this mitochondrial mechanism is critical for ghrelin-induced 
mitochondrial proliferation and electric activation of NPY/AgRP neurons, for ghrelin-triggered synaptic plasticity of 
pro-opiomelanocortin-expressing neurons, and for ghrelin-induced food intake. The UCP2-dependent action of ghrelin on 
NPY/AgRP neurons is driven by a hypothalamic fatty acid oxidation pathway involving AMPK, CPT1 and free radicals that are 
scavenged by UCP2. These results reveal a signalling modality connecting mitochondria-mediated effects of 
G-protein-coupled receptors on neuronal function and associated behaviour. 


Ghrelin alters neuronal activity in various brain regions including the 
hypothalamus’, hippocampus* and midbrain’. Ghrelin activates 
NPY/AgRP neurons of the arcuate nucleus’ to suppress pro-opio- 
melanocortin (POMC) neuronal activity while promoting food 
intake. However, the intracellular mechanisms that mediate NPY/ 
AgRP neuronal activation in response to ghrelin remain unknown. 
Uncoupling proteins are inner-membrane mitochondrial proteins 
that are important for bioenergetics of diverse tissues>*. UCP2 is 
strongly expressed in the hypothalamus including NPY/AgRP neu- 
rons of the arcuate nucleus”'° where ghrelin receptors (growth hor- 
mone secretagogue receptor 1; GHSR) are also expressed'’. Ghrelin 
enhances Ucp2 mRNA expression in the pancreas”, liver’? and white 
adipose tissue™*, and hypothalamic Ucp2 mRNA levels and protein 
activity are elevated during fasting’, when circulating ghrelin is 
increased’*. Thus, we analysed whether UCP? is involved in hypotha- 
lamic neuronal responses to ghrelin and associated food intake. 


Ghrelin induces mitochondrial respiration via UCP2 
Immunohistochemistry revealed the presence of GHSR in greater 
than 90% of UCP2-expressing, -galactosidase-positive arcuate nuc- 
leus neurons in Ucp2 ’~ (ref. 16) mice (Supplementary Fig. la—c). 
To analyse whether ghrelin affects mitochondrial respiration, we 
studied isolated hypothalamic synaptosomes” in ghrelin- and saline- 
treated animals. Mitochondrial respiration (given as nmol of oxygen 
consumed per minute per mg protein) was significantly increased 
after adding pyruvate and malate (state 2; saline 32.07 + 1.38 and 
ghrelin 44.83 + 3.15, P<0.05, wild-type mice). The subsequent 
addition of ADP (state 3) increased respiration in the ghrelin-treated 
mice relative to saline controls (saline 99.39 + 3.51 and ghrelin 
155.98 + 5.36, P<0.05, wild-type mice). Ghrelin increased basal 
uncoupling status after ATP synthase inhibition with oligomycin 
(saline 27.2 + 2.2 and ghrelin 37.3 + 1.6, P< 0.05, wild-type mice; 


Fig. la) and free-fatty-acid (palmitate)-induced respiration (saline 
150.0+5.7 and ghrelin 170.1+4.7, P<0.05, wild-type mice; 
Fig. 1b). Finally, total respiratory capacity induced by carbonylcya- 
nide-4-(trifluoromethoxy)-phenylhydrazone (FCCP) was also 
increased by ghrelin (saline 297.0 + 7.7 and ghrelin 427.9 + 19.2, 
P<0.05, wild-type mice; Fig. 1c). These increases were dependent 
on UCP2, as no significant elevation was seen for the above respira- 
tion parameters in Ucp2’" mice compared to their saline controls 
(Fig. la—c). Genipin, a compound reported to be a specific UCP2 
inhibitor’®, inhibited oxygen consumption in all states of mitochon- 
drial respiration in all tissues studied from both wild-type and 
Ucp2’~ mice (Supplementary Table 1). 

In assessing whether ghrelin’s effect on UCP2 activation involves 
Ucp2 gene expression, we found that ghrelin produced a robust 
increase in hypothalamic Ucp2 mRNA in wild-type mice, but not 
in Gshr ‘~ mice (Supplementary Fig. 1d). Ghrelin also stimulated 
Npy gene expression in the arcuate nucleus in a UCP2- (Fig. le) and 
GHSR-dependent (Supplementary Fig. le) manner. Finally, ghrelin- 
induced AgRP expression also depended on UCP2 (Supplementary 
Fig. 1f). Ghrelin had no effect on POMC (Supplementary Fig. 1g). 


Ghrelin induces UCP2-mediated mitochondrial proliferation 

During fasting the number of mitochondria in NPY/AgRP perikarya 
increases in a UCP2-dependent manner’. Ghrelin, which is elevated 
during fasting, also increased mitochondrial number in NPY neur- 
onal perikarya in wild-type but not Ucp2-’ mice (Fig. 1f and 
Supplementary Fig. lh, i). The number of mitochondria in 
POMC-GFP perikarya was unaffected (Supplementary Fig. 1)). 
Because we observed a ghrelin-induced, UCP2- and GSHR-depend- 
ent increase in nuclear respiratory factor 1 (Nrf1; Fig. 1g) expression, 
a transcription factor that stimulates mitochondrial biogenesis, we 
propose that the change in mitochondrial number in NPY/AgRP 
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neurons may also involve de novo mitochondrial biogenesis. This is 
supported by the observation that peroxisome proliferator-activated 
receptor-y coactivator 1a (PGC1A) enhances mitochondrial biogen- 
esis and respiration in association with robust induction of UCP2 
and NRF (ref. 8). 

The above results on perikaryal mitochondria number and from 
counts of synaptosomal mitochondria from hypothalamic tissue of 
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Figure 1| Mitochondrial and transcriptional effects of ghrelin are mediated 
by UCP2. a-—c, Ghrelin alters hypothalamic mitochondrial respiration. 

a, Oligomycin inhibits ATP synthase and uncouples substrate oxidation 
from ATP phosphorylation. The resulting increase in uncoupled oxygen 
consumption in ghrelin-treated mice is due to UCP2, as Ucp2-’~ mice 
exhibit no response to ghrelin. b, Ghrelin enhances respiration after 
palmitate addition in wild-type but not Ucp2 ‘~ mice. ¢, An increase in 
respiration after FCCP shows that ghrelin enhances the maximal respiratory 
capacity in wild-type but not in Ucp2-’~ mice. d, Ghrelin increases 

(10 nmol) hypothalamic Ucp2 mRNA expression as measured by real-time 
PCR (n = 6). The data are expressed as a fold increase relative to saline levels. 
e, Arcuate Npy gene expression is increased in response to ghrelin in wild- 
type but not Ucp2 ’~ mice (n = 6). f, Ghrelin promotes mitochondrial 
proliferation in a UCP2-dependent manner in arcuate NPY/AgRP cells. 

g, Ghrelin increases Nrf1 mRNA expression in the hypothalamus of wild- 
type but not Ucp2 ’ mice. h, Representative traces showing that ghrelin 
decreases the mitochondrial membrane potential after addition of energy 
substrates (state 2) and after addition of oligomycin (state 4). Asterisk 
indicates statistically significant differences (P < 0.05) with respect to saline 
controls. All error bars indicate s.e.m. Olig, oligomycin; Palm, palmitate; 
Pyr/Mal, pyruvate and malate. 
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wild-type or Ucp2-’~ mice treated with ghrelin or saline (see 
Supplementary Fig. 2a—g), together with ghrelin’s effect on state 3 
respiration and total respiration capacity of synaptosomal mito- 
chondria, suggest that elevated oxygen consumption by mitochon- 
dria in response to ghrelin arises in part from increased mitochon- 
drial number. 


Ghrelin alters mitochondrial membrane potential via UCP2 

In peripheral tissues’??? UCP2 alters mitochondrial membrane 
potential. Thus, we analysed the mitochondrial membrane potential 
in synaptosomes of ghrelin- or saline-treated wild-type and Ucp2/~ 
mice. Ghrelin-treated wild-type synaptosomes exhibited lower mito- 
chondrial membrane potential after addition of pyruvate and malate 
(state 2: 78.43 = 8.61% of saline values; P< 0.05) and oligomycin 
(state 4: 75.13 + 4.87% of saline values; P< 0.05) compared to sal- 
ine-treated wild-type synaptosomes (Fig. 1h). These differences were 
absent in Ucp2’~ mice (Supplementary Fig. 3a). These observations 
suggest that a component of the altered oxygen consumption 
induced by ghrelin may also arise from increased uncoupled respira- 
tion; however, further studies are needed to strengthen these findings 
and study the basal differences in mitochondrial membrane potential 
kinetics in wild-type and Ucp2-‘~ mice. 


Ghrelin-induced NPY/AgRP activation is dependent on UCP2 


As a surrogate of neuronal activation, c-fos expression was examined 
in response to ghrelin in NPY neurons using mice expressing NPY— 
GEP on the Ucp2 ’~ background (NPY-GFP/Ucp2 ’~ mice). After 
ghrelin injection, total NPY/c-fos dual labelling was significantly 
attenuated in NPY-GFP/Ucp2’" mice compared to NPY—GFP/ 
wild-type mice (wild type 602.33 + 98.50, Ucp2-’~ 298.40 + 64.59, 
P<0.05; Fig. 2a and Supplementary Fig. 3b-d). Neither total 
NPY cell number nor arcuate nucleus volume was affected by geno- 
type or ghrelin treatment (Supplementary Fig. 3e, f). No differ- 
ence was observed in POMC-GFP/Ucp2-‘~ mice (Supplementary 
Fig. 3g—i). 

Next, patch-clamp whole-cell electrophysiological recordings in 
slice preparations from NPY-GFP/wild type and NPY-—GFP/ 
Ucp2 ’~ mice showed that ghrelin increased action potential firing 
in all NPY neurons; however, it was severely attenuated in Ucp2/~ 
mice compared to wild-type animals (Fig. 2b and Supplementary 
Fig. 4a). This defect was specific to the action of ghrelin on NPY 
neurons, as glutamate produced normal responses in wild-type and 
Ucp2-’~ mice (Supplementary Fig. 4b). Ghrelin had no effect on 
POMC-GFP neuronal action potential frequency. These results are 
concordant with anatomical observations showing that 94% of NPY 
and just 8% of POMC neurons express GHSRs"’. 


Ghrelin-induced synaptic plasticity is mediated by UCP2 


Ghrelin indirectly hyperpolarizes POMC neurons by activating 
inhibitory NPY/AgRP inputs', and, consistent with this, ghrelin- 
induced recruitment of miniature inhibitory postsynaptic currents 
(mIPSCs) on POMC cells was significantly elevated in wild-type 
animals (153.8+ 22.4%, n=5), but not in Ucp2 /~ mice 
(100.5 + 2.5%, n = 5; Fig. 2c and Supplementary Fig. 4c). 

Changes in mIPSCs are associated with synaptic plasticity 
Corresponding to the mIPSC measurements, ghrelin treatment 
increased symmetric, putatively inhibitory synapses on POMC neu- 
rons in wild-type (saline 8.33 + 0.97 and ghrelin 16.55 + 1.68) but 
not in Ucp2-’ mice (saline 7.23 + 1.63 and ghrelin 9.12 + 1.63; 
Fig. 2d). Conversely, ghrelin significantly reduced the number of 
asymmetric, putatively excitatory synapses on POMC neurons in 
wild-type mice (saline 15.75 + 1.89 and ghrelin 8.66 + 1.05) but 
not in Ucp2-’~ mice (Supplementary Fig. 4d). Although the inhib- 
itory input of POMC neurons arises partially from arcuate nucleus 
NPY/AgRP neurons, the origin of the stimulatory synapses needs 
further exploration. 
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UCP2 mediates ghrelin-induced food intake 


To determine whether the UCP2-dependent cellular responses 
described above are relevant for behaviour, we analysed the feeding 
behaviour of wild-type and Ucp2-’ mice in response to ghrelin 
administration (intraperitoneal, 10 nmol) at the beginning of the 
dark phase (Fig. 2e). At 1, 2 and 3h after injection, ghrelin-induced 
hyperphagia was severely attenuated in Ucp2-’~ mice compared to 
wild-type mice (1h: wild type 4.45 + 0.24, Ucp2 ’ 2.23 + 0.19, 


P<0.01; 2h: wild type 4.41+0.36, Uc 2" 172+ 0.03, 
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Figure 2 | Effects of ghrelin on neuronal firing, synaptic plasticity and 
feeding are mediated by UCP2. a, UCP2 potentiates NPY c-fos activation in 
the arcuate nucleus in response to ghrelin (10 nmol). Stereological analysis 
of total NPY—GFP/c-fos cells in the arcuate nucleus of wild-type (n = 6) and 
Ucp2 ‘~ mice (n = 6) is shown. b, Ghrelin increases mean action potential 
frequency of NPY-GFP neuronal perikarya in whole-cell recordings from 
wild-type mice (n = 7); this effect is diminished in Ucp2 ’~ mice (n= 7). 
c, Ghrelin increases the frequency of mIPSCs on POMC-GFP perikarya in 
wild type but not in Ucp2°’~ mice (n = 5 for both groups). d, Ghrelin 
increases the number of symmetric, putatively inhibitory perikaryal 
synapses on POMC neurons in wild-type mice but not in Ucp2 ’~ mice. 
e, Peripheral ghrelin administration (intraperitoneal) significantly 

(P < 0.05) enhances food intake relative to saline controls in both wild-type 
(n= 6) and Ucp2 /~ (n = 6) mice at 1, 2 and 3h after injection. However, 
food intake in Ucp2-’~ mice is significantly attenuated compared to wild- 
type mice 1, 2 and 3h after injection. f, Mediobasal hypothalamus ghrelin 
injection (500 pmol ul ') significantly increases food intake of both 
genotypes at 1, 2 and 3h after injection. However, Ucp2 ’~ mice display 
severely attenuated food intake at all time points compared to wild-type 
controls. Asterisk indicates statistically significant differences (P < 0.05) 
with respect to saline controls; dagger indicates statistically significant 

(P < 0.05) reduction with respect to ghrelin-treated wild-type mice. All error 
bars indicate s.e.m. 
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P<0.001; Fig. 2e, units are fold intake relative to saline controls). 
We and others recently demonstrated that ghrelin can act at multiple 
sites in the brain to induce food intake*”’. Thus, we injected ghrelin 
(500 pmol pl ') locally into the mediobasal hypothalamus or ventral 
tegmental area (VTA) (Supplementary Fig. 4e, f) immediately before 
the dark phase. At all time points, intra-mediobasal hypothalamus 
ghrelin-induced food intake was significantly lower in Ucp2-’~ mice 
compared to wild type (Fig. 2f). Intra-VTA, ghrelin-induced food 
intake was also attenuated in Ucp2‘~ mice compared to wild type at 
1h but not at 2 or 3 h after ghrelin injection (Supplementary Fig. 4g). 
These data suggest that UCP2 regulates ghrelin-induced feeding 
behaviour in both the mediobasal hypothalamus and VTA. The 
longer lasting effect of ghrelin in the mediobasal hypothalamus sug- 
gests that mediobasal hypothalamus control dominates ghrelin- 
induced feeding behaviour and is consistent with the finding that 
mice with conditional knockout of arcuate nucleus NPY/AgRP neu- 
rons in adulthood do not respond to ghrelin”. 

We also tested the effect of leptin-reduced food intake in Ucp2-/~ 
mice compared to wild-type controls. No significant effects were 
observed between the two groups (Supplementary Fig. 5a, b). 


Ghrelin induces AMPK and ACC phosphorylation 


Ghrelin increases AMPK activity within the hypothalamus”, includ- 
ing NPY/AgRP neurons”’, which promotes feeding**”’ and enhances 
mitochondrial biogenesis and function in the periphery**”. We 
found that ghrelin promotes AMPK phosphorylation (pAMPK/ 
AMPK ratio) in a GHSR-dependent manner (Fig. 3a) in both wild- 
type and Ucp2-’~ mice (Fig. 3b), indicating that ghrelin’s action on 
AMPK is downstream of GHSR but upstream of UCP2. We also 
observed that ghrelin elevates phosphorylated acetyl-CoA carboxy- 
lase (ACC), consistent with increased AMPK activity (Fig. 3b). 
Intracerebroventricular injection of a stimulator of AMPK activity, 
5-aminoimidazole-4-carboxamide-1-B-D-ribofuranoside (AICAR), 
like ghrelin, increased both mitochondrial coupled and uncoupled 
respiration in the hypothalamus of wild-type but not Ucp2-‘~ mice 
(Supplementary Fig. 6a). In addition, AICAR promoted food intake 
in both wild-type and Ghsr ‘~ mice in a similar manner (Fig. 3c), 
whereas AICAR-induced food intake was significantly diminished in 
Ucp2-‘" mice compared to wild type (Fig. 3d). 
Intracerebroventricular injection of compound C, an inhibitor of 
AMPK, inhibited food intake induced after peripheral ghrelin injec- 
tions in wild-type mice, but did not further decrease food intake in 
Ucp2’~ animals (Fig. 3e), and, it also prevented ghrelin-induced 
elevations in Ucp2 mRNA levels in wild-type mice (Fig. 3f). These 
results confirm that ghrelin activates AMPK signalling through the 
GHSR, and reveal that AMPK requires UCP2 to exert an effect on 
feeding. 


Ghrelin-induced food intake is mediated by CPT1 


AMPK activation and ACC inhibition lead to a reduction in malonyl- 
CoA and release of carnitine palmitoyltransferase 1 (CPT1) from 
allosteric inhibition. Increased CPT1 activity enhances oxidation of 
long chain fatty acids (LCFA) and increases food intake*’. Ghrelin 
increased the expression of hypothalamic Cpt] mRNA (Fig. 3g) and 
this was dependent on AMPK activity, as intracerebroventricularly 
administered compound C blocked this effect (Fig. 3g). The induc- 
tion of Cpt! mRNA by ghrelin occurred rapidly, after just Lh 
(Supplementary Fig. 6b). Ghrelin-triggered elevation in Ucp2, Npy 
and Nrfl mRNA levels also occurred after 1 h (Supplementary 
Fig. 6c). The ghrelin-induced increase in Ucp2 mRNA levels was 
diminished by concomitant intracerebroventricular administration 
of either compound C or etomoxir (CPT1 inhibitor) (Fig. 3f), con- 
firming that ghrelin-induced Ucp2 mRNA expression is mediated by 
the AMPK-CPT1 pathway. Furthermore, etomoxir and compound 
C prevented the ghrelin-induced increase in both Npy and Agrp gene 
expression (Fig. 3h and Supplementary Fig. 6d), mimicking the 
impairments found in Ucp2’ mice (see Fig. le and 
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Supplementary Fig. 1f). No difference in Pomc mRNA was observed 
in any group (data not shown). In line with the mRNA expression 
data, etomoxir also suppressed ghrelin-induced food intake (Fig. 3i). 

The aforementioned data indicate that CPT1 activation, similar to 
AMPK activation, is upstream of UCP2. However, we found that 
ghrelin-induced elevation of hypothalamic Cpt! mRNA levels is 
not observable in Ucp2-’~ animals (Fig. 3). This apparent paradox 
is resolved in studies described below regarding hypothalamic free 
radicals. 


Ghrelin raises hypothalamic LCFA CoAs and blood NEFAs 


To assess whether ghrelin increases the availability of substrates for 
mitochondrial beta oxidation coinciding with AMPK and CPT1 
activation, we analysed hypothalamic levels of LCFA CoAs 3h after 
ghrelin administration. Total LCFA CoAs (comprising both 16- and 
18-carbon LCFA CoAs) were significantly higher in the hypothal- 
amus of ghrelin- versus saline-treated controls (saline 29.15 + 1.17, 
ghrelin 33.72 + 1.15nmol metabolite per g tissue; P< 0.05; n= 4 
and 6 respectively; Supplementary Fig. 6e). Next, we analysed 
LCFA CoAs in the hypothalamus at 0, 30, 60, 120 and 180 min after 
ghrelin administration. Compared to time 0 (36.18 + 2.25 nmol 
metabolite per g tissue), by 30 min (in the absence of food) hypotha- 
lamic LCFA CoA levels were significantly elevated (40.75 + 
1.55 nmol metabolite per g tissue, P<0.05) and remained high 
through 60 (43.08+0.95 nmol metabolite per g tissue), 120 
(42.24+0.62 nmol metabolite per g tissue) and 180 min 
(40.9 + 0.42 nmol metabolite per g tissue; Supplementary Fig. 6f). 
Although no statistical differences were detected between these values, 
the trend suggests that from 60 min onwards, hypothalamic levels of 
LCFA CoAs may start to descend (Supplementary Fig. 6f), perhaps 
due to increased use and oxidation of LCFA CoAs. Supporting this 
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idea is the paradigm in which chronically elevated ghrelin (20 h of 
fasting) resulted in depletion of hypothalamic levels of LCFA CoAs”’. 
We also found that non-esterified fatty acid (NEFA) levels in the 
circulation (0.23 + 0.3 mequiv. 1 | at time 0) start to rise 30 min after 
ghrelin injection (0.30 + 0.3 mequiv.1~', P> 0.05), and become sig- 
nificantly higher at 60 (0.42 +0.3mequiv.1~', P<0.01), 120 
(0.51 + 0.07 mequiv. 1!) and 1,080min (1.09+0.11 mequiv. 4, 
P<0.00001) (Supplementary Fig. 6g). The elevated NEFA levels dur- 
ing negative energy balance accompanied by an initial rise but sub- 
sequent decline in hypothalamic LCFA CoA levels is consistent with 
the conclusions of ref. 31 suggesting that the source of hypothalamic 
LCFA CoAs is the periphery, and also with our observations that 
hypothalamic LCFA CoAs may undergo beta oxidation. 


Ghrelin reduces ROS in NPY/AgRP neurons via UCP2 


A by-product of increased mitochondrial fatty acid beta oxidation is 
increased ROS production**”’. Ghrelin significantly increased synap- 
tosomal ROS in Ucp2-’~ but not in wild-type hypothalami com- 
pared to saline controls (Fig. 4a). Next, we injected ghrelin or saline 
together with dihydroethidium bromide (DHE), a compound that 
fluoresces as it is oxidized, to analyse in situ ROS levels in NPY—GFP 
and POMC-GFP mice on wild-type and Ucp2-’~ backgrounds. 
There was no difference in the amount of fluorescent DHE in NPY 
neurons of wild-type animals treated with either ghrelin or saline 
(Fig. 4b and Supplementary Fig. 7). However, ghrelin treatment of 
Ucp2-’~ animals resulted in increased levels of DHE in NPY neurons 
compared to saline controls (Fig. 4b and Supplementary Fig. 7). The 
analysis of randomly selected, non-GFP-positive cells showed a shift 
in DHE levels similar to that seen in NPY neurons (quantified data 
not shown). In contrast, however, DHE levels in POMC-—GFP 
neurons were significantly higher in wild-type mice treated with 


Figure 3 | Ghrelin activates an intracellular pathway of fatty acid 
metabolism. a, Ghrelin increases the pAMPK/AMPK ratio in wild-type but 
not Ghsr ‘~ mice (n= 5). Ghr, ghrelin; Sal, saline. b, Ghrelin increases the 
pAMKP/AMPK ratio and phosphorylates downstream ACC in both wild- 
type and Ucp2 ’~ mice, indicating that activation of AMPK is upstream to 
increased UCP2 activity (n = 6). c-e, AMPK regulation affects food intake. 
c, AICAR significantly increases food intake relative to saline controls in 
wild-type and Ghsr ’~ mice. d, AICAR-induced food intake is significantly 
suppressed in Ucp2 ’~ mice compared to controls, indicating that UCP2 is 
required to translate AMPK activity into a complete food intake response 
(n= 5). e, AICAR significantly increases cumulative 3-h food intake relative 
to saline controls; however, this effect is completely blocked with a 
concurrent intracerebroventricular injection of compound C (CC, 

6 ug ul’), an inhibitor of AMPK activity. AICAR did not affect the 
cumulative 3-h food intake in Ucp2-’~ mice and this was not affected by 
concurrent intracerebroventricular injection of compound C. 

f, Intraperitoneal ghrelin administration increases hypothalamic Ucp2 
mRNA; however, this increase is blocked by concurrent injection of 
etomoxir or CC, indicating that activation of the AMPK—CPT1 pathway is 
important in the regulation of Ucp2 mRNA expression. g, Intraperitoneal 
ghrelin administration increases hypothalamic Cpt] mRNA expression; 
however, the induction of Cpt] mRNA with ghrelin (Ghr) is blocked when 
the CPT1 inhibitor etomoxir (Eto), or compound C (CC), is injected 
intracerebroventricularly. h, Peripheral ghrelin injection increases 
hypothalamic expression of Npy mRNA, which is completely suppressed by 
intracerebroventricular injection of etomoxir and CC. Note that in 

f-h mRNA expression was not affected by intracerebroventricular injection 
of etomoxir or compound C with saline injected intraperitoneally. 

i, Inhibition of CPT1 activity with an intracerebroventricular injection of 
etomoxir suppresses ghrelin-induced (IP 10 nmol) food intake compared to 
mice receiving intraperitoneal ghrelin and intracerebroventricular saline 
(Ghr/Sal). Neither intracerebroventricular etomoxir nor compound C 
affected food intake in mice treated intraperitoneally with saline (n = 5-7). 
j, Ghrelin increased Cpt] mRNA expression in wild-type but not in Ucp2 “~ 
mice (n = 6, P< 0.05). Asterisk, statistically significant (P< 0.05) with 
respect to saline controls; dagger, statistically significant (P < 0.05) with 
respect to Ghr/Sal; double dagger, statistically significant (P < 0.05) with 
respect to Ghr/Eto. All error bars indicate s.e.m. 
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saline compared to those given ghrelin (Fig. 4b and Supplementary 
Fig. 7). In Ucp2-‘~ mice, lower levels of DHE were observed in 
POMC neurons in both the saline- and ghrelin-treated mice 
(Fig. 4b and Supplementary Fig. 7). Overall these data are consistent 
with UCP2’s putative ability to buffer ROS levels. 


ROS scavenging reverses responses of Ucp2~’~ mice 

In light of the data on ROS, we hypothesized that the paradoxical lack 
of Cpt mRNA induction by ghrelin in Ucp2‘~ animals (Fig. 3j) may 
be due to an effect of ROS on CPT1. Thus, we injected a ROS sca- 
venging cocktail containing ascorbic acid (10 uM), N-acetyl-L-cysteine 
(10 pM) and TEMPO (1 tM) intracerebroventricularly into wild-type 
or Ucp2-’~ mice with a concurrent intraperitoneal injection of ghrelin 
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Figure 4 | ROS is a critical regulator of cellular and behavioural responses 
to ghrelin. a, Ghrelin increases hypothalamic synaptosomal ROS 
production in Ucp2 ’~ mice but not wild-type mice (n = 4-5). 

b, Quantification of ROS production shows that ghrelin markedly increases 
in situ ROS in NPY neurons in Ucp2 ’~ but not wild-type mice. 

c, Suppressed Cpt] mRNA expression in ghrelin-treated Ucp2 ’~ mice after 
3h is completely reversed and elevated above wild-type levels by concurrent 
intracerebroventricular injection of free radical scavenging (FRS) cocktail 
(n= 5, P<0.05). i.c.v., intracerebroventricular; i.p., intraperitoneal. d, FRS 
cocktail reverses suppressed Npy mRNA in ghrelin-treated Ucp2 ’~ mice. 
e, Incubating arcuate slices with FRS restores suppressed NPY neuronal 
action potential firing in Ucp2 ’~ mice. Neurons were from NPY-GEP/ 
Ucp2 ~ mice. Asterisk, significant to 100% saline controls. f, The FRS 
cocktail reverses the suppressed ghrelin-induced food intake after 1h in 
Ucp2-’~ mice (n = 9, P< 0.05). Except for e: asterisk, significant to saline 
controls; dagger, significant decrease compared to wild-type mice 
administered ghrelin intraperitoneally and saline intracerebroventricularly; 
double dagger, significant with respect to Ucp2 ’~ mice administered 
ghrelin intraperitoneally and saline intracerebroventricularly. All error bars 
indicate s.e.m. 
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(10 nmol). Scavenging of ROS enabled ghrelin to induce elevated Cptl 
mRNA levels in Ucp2 ‘~ mice (Fig. 4c). 

Our results support the hypothesis that ghrelin-triggered ROS 
production promotes UCP2 activity and mRNA expression and that 
the main action of UCP2 in these circuits is to promote ROS scav- 
enging. In line with this, we found that after intracerebroventricular 
injection of ROS scavenging cocktail, ghrelin-induced elevations in 
hypothalamic Ucp2 mRNA levels in wild-type mice were diminished 
(Supplementary Fig. 8a), which was paralleled by rescue of induction 
of Npy and Agrp mRNA levels by ghrelin in Ucp2’~ mice (Fig. 4d 
and Supplementary Fig. 8b). In addition, incubating isolated 
hypothalamic synaptosomes from ghrelin-treated wild-type mice 
with the ROS scavenging cocktail suppressed ghrelin-induced 
increases in mitochondrial uncoupled respiration after oligomycin 
treatment but not in state 3 or FCCP-driven respiration 
(Supplementary Fig. 8c). When hypothalamic slices from Ucp2-/~ 
mice were incubated (for 20 min) with the ROS scavenging cocktail, 
ghrelin-evoked NPY neuronal firing was also restored to wild-type 
values (Figs 2b and 4e). Moreover, in the presence of the ROS scav- 
enger, washout of ghrelin did not silence NPY neurons (Fig. 4e), 
indicating that the intracellular events triggered by ghrelin not only 
potentiated but also maintained neuronal firing subsequent to the 
initial ghrelin signalling. In association with the effect of ROS scav- 
enger on cellular events in the hypothalamus of Ucp2”’~ mice, ghre- 
lin’s ability to promote feeding was restored as well in Ucp2-’~ mice 
pre-treated with the ROS scavenger (Fig. 4f). 


Discussion 


We have identified intracellular events that contribute to appropriate 
activation of NPY/AgRP neurons in response to ghrelin 
(Supplementary Fig. 9). Ghrelin induces a rapid increase of NPY/ 
AgRP neuronal firing by activation of GHSR. Ghrelin, via GHSR, also 
results in AMPK activation that suppresses ACC activity, eliminating 
the inhibition of CPT1. CPT1 activation can enhance LCFA entry 
into mitochondria and increase beta oxidation. Fatty acid beta oxida- 
tion promotes generation of ROS**”*, which together with fatty acids 
promote UCP2 transcription and activity**. UCP2 activity neutra- 
lizes ROS”, thereby allowing continuous CPT1-promoted fatty acid 
beta oxidation that enables continuous support of the bioenergetic 
needs to maintain firing of NPY/AgRP cells. Sustained firing of NPY/ 
AgRP neurons, the efferents of which are GABAergic*®*” onto POMC 
neurons, results in activity-dependent synaptic plasticity promoting 
an organization of increased inhibitory input onto POMC neurons 
leading to increased feeding. Our results also indicate that fatty acids, 
which are elevated in the circulation concurrently with ghrelin dur- 
ing fasting'’, are acutely increased in the hypothalamus in response to 
ghrelin administration so that they can serve as a continuous fuel 
supply for NPY/AgRP neurons during negative energy balance. In 
light of previous work”, we predict that the absence of fatty acid beta 
oxidation such as during increased carbohydrate load or positive 
energy balance may affect UCP2 activation and lead to a different 
cellular outcome. In support of this, it was recently reported that 
elevated glucose levels in animals on a high fat diet promote UCP2 
induction, which was paralleled with impaired ability of POMC neu- 
rons to sense glucose**. The apparent elevated baseline ROS levels in 
POMC neurons are consistent with the possibility of increased car- 
bohydrate fuel-based activity in these cells at the time of relative 
satiety (when UCP2 would not be activated), because carbohydrates, 
and glucose in particular, are major generators of ROS through 
increasing electron transport at complex 1. Because NPY neurons 
show lower ROS production than POMC neurons, it is tempting 
to speculate that the NPY/AgRP neurons are less susceptible to 
free-radical-induced damage, leading to resistance of orexigenic 
hypothalamic mechanisms to cellular stress. Conversely, POMC cells 
may be more vulnerable to free-radical-induced damage and pro- 
gressive degeneration, which may have relevance to age-related onset 
of obesity. Because GHSR and UCP2 expression overlaps in other 
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brain sites, including the midbrain, we predict that ghrelin modu- 
lation of other brain functions will also be regulated by UCP2-modu- 
lated fatty acid metabolism. 


METHODS SUMMARY 


Detailed methodology can be found in the Supplementary Information. In short, 
tissue preparation, immunocytochemistry, synaptosomal respiration, Real-time 
PCR, quantitative and qualitative electron microscopic analyses of mitochondria 
and synapse number, electrophysiology and feeding behaviour analyses were 
performed as previously described’*?**”, 

The mitochondrial membrane potential was measured in isolated synaptoso- 
mal mitochondrial fractions from the hypothalamus using A¥,,-sensitive 
TMRE and fluorescent spectrofluorophotometry. Non-esterified fatty acids were 
measured in plasma using a colorimetric non-esterified fatty acid (NEFA) kit 
(Wako) following the manufacturer’s instructions. LCFA CoAs were measured 
by mass spectroscopy. 

ROS production was quantified using dichlorodihydrofluorescein diacetate 
(DCF) in mitochondrial fractions and using DHE in fixed tissue. 
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Structural basis for translation 
termination on the 70S ribosome 


Martin Laurberg’*, Haruichi Asahara'*, Andrei Korostelev'*, Jianyu Zhu', Sergei Trakhanov’ & Harry F. Noller’ 


At termination of protein synthesis, type | release factors promote hydrolysis of the peptidyl-transfer RNA linkage in 
response to recognition of a stop codon. Here we describe the crystal structure of the Thermus thermophilus 70S ribosome in 
complex with the release factor RF1, tRNA and a messenger RNA containing a UAA stop codon, at 3.2 A resolution. The stop 
codon is recognized in a pocket formed by conserved elements of RF1, including its PxT recognition motif, and 16S ribosomal 
RNA. The codon and the 30S subunit A site undergo an induced fit that results in stabilization of a conformation of RF1 that 
promotes its interaction with the peptidyl transferase centre. Unexpectedly, the main-chain amide group of Gln 230 in the 
universally conserved GGQ motif of the factor is positioned to contribute directly to peptidyl-tRNA hydrolysis. 


Protein synthesis is carried out by the ribosome, where genetic 
information encoded in mRNA is translated into a polypeptide chain. 
Translation ends when a stop codon on mRNA reaches the decoding 
centre in the A site of the small ribosomal subunit. In contrast to the 
recognition of sense codons by tRNAs, stop codons are recognized by 
extra-ribosomal proteins called class I release factors, which also pro- 
mote hydrolysis of the peptidyl-tRNA linkage in the peptidyl transfer- 
ase centre of the large ribosomal subunit'”. In bacteria, the UAG and 
UAA stop codons are recognized by release factor RF1, and the UGA 
and UAA codons by RF2 (refs 46). The low error frequency (10° *— 
10°) of stop codon recognition is accomplished without a proof- 
reading mechanism’. In principle, stop codons could be recognized 
directly by specific molecular interactions with the release factors, 
indirectly by release-factor-promoted interactions with the ribosome, 
or by some combination of the two. Similarly, hydrolysis of the pep- 
tidyl-tRNA linkage could be catalysed by the factors themselves, or by 
conversion of the ribosomal peptidyl transferase to an esterase by 
interactions with the factors. Low-resolution cryo-electron micro- 
scopy and X-ray studies of termination complexes*'® have revealed 
that the release factors bind to the ribosome in an orientation that 
roughly coincides with that of a tRNA bound to the A site, placing 
their extremities in the decoding centre of the small subunit and in the 
peptidyl transferase centre in the large subunit. 

On the basis of mutational studies, a direct recognition model 
involving a ‘tripeptide anticodon’ has been proposed for stop-codon 
recognition'’'*. According to this model, the conserved motifs PxT 
(in RF1) or SPF (in RF2) decipher the corresponding stop codons. 
X-ray structures of the release factors bound to the ribosome at 5.9 A 
and 6.7 A resolution® showed that these tripeptide motifs are indeed 
positioned close to the stop codon, but left the mechanism of recog- 
nition unresolved. Although structural changes involving the univer- 
sally conserved nucleotides guanosine (G)530, adenosine (A)1492 
and A1493 of 16S have been shown to have a critical role in discrim- 
ination of cognate tRNAs'*, it is unclear whether these nucleotides 
play an active part in translation termination’®””. 

Mutational analysis and molecular simulation studies have impli- 
cated the universally conserved GGQ motif in hydrolysis of the pep- 
tidyl-tRNA ester linkage’**’. Although its glutamine side chain has 
been suggested to be involved directly in catalysis**’, mutation of 


this residue has generally led to only small effects'?*'****, whereas 


mutation of the glycines confers up to 10*-fold slower rates of peptide 
release'**'. Proposed roles for the GGQ motif include orienting a 
water molecule for nucleophilic attack**”’, passively opening a path 
for access of a water molecule to the peptidyl transferase centre 
(PTC)* and excluding other nucleophiles from the esterase reac- 
tion’. Finally, the mechanism by which peptidyl-tRNA hydrolysis 
is coupled to stop-codon recognition is not understood. Thus, the 
molecular bases of all three of the principal mechanisms of trans- 
lation termination remain unresolved. 

Here we present the 3.2A crystal structure of a translation ter- 
mination complex containing the T. thermophilus 70S ribosome, a 
defined mRNA, an initiator tRNA™* bound to an AUG codon in the 
P site, non-cognate tRNA™*“ in the E site and release factor RF1 
bound in response to a UAA stop codon in the A site. The structure 
reveals that during termination the proline and threonine side chains 
of the PxT motif both contact the stop codon bases, accompanied by 
conformational changes in the decoding centre. However, discrim- 
ination of the stop codon also involves a network of interactions 
between the mRNA, 16S rRNA, and backbone and side-chain atoms 
of RF1 outside the PxT motif. Although Gln 230 of the conserved 
GGQ motif interacts directly with the PTC, its main-chain amide 
group, rather than its side chain, is positioned to play a critical part in 
catalysis. Our structure suggests that stop-codon recognition is 
coupled to rearrangement of a loop between domains 3 and 4 of 
RF1, which results in positioning of the GGQ motif in the PTC. 


Recognition of the stop codon 

The overall shape and orientation of the release factor in the 70S 
ribosome (Fig. 1) is similar to that reported in low-resolution 
X-ray and cryo-electron microscopy studies*"'®. Domains 2, 3 and 
4 of the factor occupy the ribosomal A site, superimposing approxi- 
mately on the position of tRNA bound to the A site (Fig. 1a, b). The 
conformations of the decoding centre of the ribosome and of the stop 
codon in the termination complex differ markedly from those in 
sense-codon-recognition complexes'*'* (Supplementary Fig. 2b). 
In the sense-codon complex, A1492 and A1493 of 16S rRNA both 
flip out from their ground-state positions in helix 44 (h44) to form 
A-minor interactions with the minor groove of the codon—anticodon 
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helix (Supplementary Fig. 2b)'*’’. In the termination complex, only 
A1492 flips out, leaving A1493 stacked within h44 (Fig. 2a and 
Supplementary Fig. 2d). A1913, at the tip of the loop of h69 of the 
23S rRNA, occupies the space vacated by A1492 in h44 and stacks on 
A1493 (Fig. 2a and Supplementary Fig. 2d). This conformational 
change seems to have a critical role in the termination mechanism, 
as explained later. 

The UAA stop codon is bound in a pocket formed by conserved 
elements of 16S rRNA and domain 2 of RF1, which contains the 
proposed PxT tripeptide anticodon motif’* (Fig. 2 and 
Supplementary Figs 3 and 4). Packing of the tip of helix «5 of RF1 
at Gly 116 against the Watson—Crick edge of uridine (U)1 of the stop 
codon (Fig. 2b) discriminates against a purine at this position. A 
hydrogen bond between U1 and the backbone of Gly 116 is possible 
only with U in this position, as are hydrogen bonds from the Glu 119 
backbone amide and the hydroxyl of Thr 186 in the PxT motif to the 
O4 position of U1. These extensive interactions explain why discrim- 
ination is strongest for the first base of the stop codon’. In the second 
position, A2 of the stop codon is stacked between U1 on one side and 
the imidazole of His 193 on the other. The Watson—Crick edge of A2 
packs against the side-chains of Pro 184 and Glu 119; its 6-amino 
(N6) group hydrogen bonds with the hydroxyl of Thr 186 (Fig. 2b 
and Supplementary Fig. 4a). Discrimination against guanosine (G) at 
the second position is probably explained by the inability of its 06 to 
hydrogen bond with Thr 186 (which donates its only hydrogen to 
U1). It is less obvious why pyrimidines are excluded here; possible 
explanations are a weaker propensity for stacking with U1 and the 
energy penalty caused by a loss of packing against the hydrophobic 
moieties of Pro 184 and Glu 119. Thus, the PxT motif interacts with 
the first two bases of the stop codon, rather than with the second and 


Figure 1| Structure of RF1 in the termination complex. a, Positions of RF1 
(yellow), P-site tRNA (orange), E-site tRNA (red) anda mRNA (green) in the 
70S ribosome. b, An elongation complex*’ showing the position of A-site 

tRNA (yellow) in the context of the current structure. c, Orientation of RF1, 
showing the P-site tRNA (orange), the peptidyl transferase centre (PTC), the 
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third bases, as proposed previously’”"*; however, this motif is by no 
means the sole determinant of codon recognition. 

A3 is unstacked from the first two bases of the stop codon, diverted 
by the RF1 main chain at residues 192-193 into a position that differs 
markedly from that of the third base in a sense codon’*” (Fig. 2c), 
stacking instead on G530. Its N6 amino group donates hydrogen 
bonds to the side chains of Gln 181 and Thr 194, and its N7 accepts 
a hydrogen bond from the side chain of Thr 194 (Fig. 2c), permitting 
recognition of both A and G by RFI, while discriminating against 
pyrimidines. In the backbone of the stop codon, the 2’-hydroxyls of 
U1 and A2 interact with phosphate 1493 and ribose 1492, respect- 
ively, while the 2'-hydroxyl of A3 hydrogen bonds with the main- 
chain carbonyl of Ile 192 (Fig. 2b, c). Surprisingly, substituting 
deoxyriboses in the backbone of the stop codon confers only mar- 
ginal defects in peptide release and RF1 binding”, suggesting that 
these interactions are not critical for stop-codon recognition. 

Our structure explains the recently reported observation of com- 
petitive binding between RF1 and paromomycin to the ribosome”. 
When paromomycin is bound, A1493 flips out of h44 (ref. 14), steri- 
cally preventing the binding of RF1 (Supplementary Fig. 2c); con- 
versely, in the RF1 complex, A1493 occupies the binding site for 
paromomycin in h44. RF1 function thus involves participation of 
nucleotides G530, A1492 and A1493, although in an entirely different 
way from that seen in sense-codon recognition’®’’. Superposition of 
the recognition domain of RF2 on that of RF1, on the basis of the 
similarities in folding of domain 2, and in particular the specificity 
loops for the structures of the free forms of both type I release factors 
and of RFI in the termination complex, allows us to extend our 
observations to suggest how RF2 may recognize its UAA and UGA 
codons (see Supplementary Information and Supplementary Fig. 6). 


GGQ 


a7 


Switch loop 


decoding centre (DC) and helices h43 and h95 of 23S rRNA. d, The structure 
of RF1 in its ribosome-bound conformation, rotated ~180° from a and 

c. GGQ and PVT motifs are shown in red, and the switch loop (see text) in 
orange. The domains of RF1 (1-4) are indicated. 
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Peptidyl transferase centre 

The GGQ motif at positions 228-230, universally conserved in the 
type I release factors of all organisms, has been implicated in their 
ability to promote peptidyl-tRNA hydrolysis'*”°. Biochemical studies 
have suggested that the role of the factors in the hydrolysis reaction is 
to convert the ribosomal peptidyl transferase to an esterase in a 
reaction using water as an acceptor for the peptidyl moiety**”'. 
Our structure shows that the GGQ motif indeed resides in the PTC 
(Fig. 3a), in agreement with previous low-resolution X-ray findings, 
although the Ca-backbone tracing of this region of RF1 as modelled 
at 5.9 A resolution® is incompatible with the 3.2 A electron density 
map. Residues 228-235 rearrange from their free form” to form a 
short helical element (Fig. 3a, b), which probably contributes to the 
rigidity of placement of the critical residues. At its end, the GGQ 
motif interacts with the PTC and with A76 of the P-site tRNA 
(Fig. 3 and Supplementary Fig. 7). The main-chain amide of 
Gln 230 is found within hydrogen bonding distance of the 3'OH of 
A76 of the deacylated tRNA bound in the P site (Fig. 3b), whereas its 
side-chain amide is positioned away from the location of the scissile 
ester bond, in a pocket formed by A2451, C2452, U2506 and ribose 76 
(Fig. 3b and Supplementary Fig. 7), consistent with its proposed role 
in discrimination of the attacking nucleophile”. 

The conformations of nucleotides U2506, G2583, U2584, U2585 
and A2602 in the PTC of 23S rRNA are consistent with their sens- 
itivity to the occupancies of the A and P sites'’®*°’” (See 
Supplementary Information and Supplementary Fig. 8f-h). 
Mutational analysis has shown A2602 to be critical for release-fac- 
tor-dependent termination**~”°. Contrary to proposals concerning its 
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direct involvement in catalysis of the hydrolysis reaction”**”°, A2602 
is buried in a cavity in RF1 and blocked from the catalytic site by the 
path of the main chain of RF1 at Gly 228 (Fig. 4 and Supplementary 
Fig. 7c). The observed defects in peptide release by A2602 mutant 
ribosomes***° must therefore be caused by the contribution of A2602 
to the binding and/or positioning of domain 3 of RF! for catalysis. 
Proposed mechanisms for the function of RF1 in peptide release 
include coordination or activation of the attacking water molecule 
and induction of conformational changes in the PTC***”®. In our 
structure, the main-chain amide of Gln 230 is positioned to form a 
hydrogen bond with the 3'-hydroxyl group of the terminal nucleos- 
ide A76 of the deacylated P-site tRNA. Interpretation at this level of 
structural detail is facilitated by the location of Gln 230 at the end ofa 
helical element, which, supported by the well-resolved electron den- 
sity for its side chain, places strong constraints on both the position 
and the orientation of its backbone amide nitrogen (Supplementary 
Fig. 7b). Thus, the Gln 230 main-chain amide may help to favour 
ester hydrolysis by coordinating the leaving group (Fig. 3b, c), ana- 
logous to a mechanism that has been proposed for hydrolysis of GTP 
by Ras*’*?. Moreover, superposition of the structure of a transition- 
state analogue bound to the 50S subunit*”” with our structure places 
the oxyanion of the transition-state tetrahedral intermediate at 
hydrogen-bonding distance from the backbone amide nitrogen of 
Gln 230 (Fig. 3d), suggesting the possibility of transition-state stabil- 
ization, and thus potentially contributing to enhancement of the 
catalytic rate (Fig. 3d, e). When the structure of an oligonucleotide 
analogue of deacylated tRNA bound at the A site™ is superimposed 
on that of the termination complex, the 3'-hydroxyl of its A76 is also 


Figure 2 | Interactions with the UAA stop codon in the decoding centre of 
the RF1 termination complex. a, Stereo view of o,-weighted 3F,,,—2F calc 
electron density map of the stop codon and surrounding elements of the 
ribosome and RF1. The density for RF1 was contoured at 1.00, and for the 
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mRNA and rRNA at 1.50. b, Recognition of U1 and A2 of the UAA stop 
codon. ¢, Discrimination of the third base (A3) by RF1, showing its stacking 
on G530 of 16S rRNA. 16S rRNA is shown in cyan, RF1 in yellow, mRNA in 
green and 23S rRNA (A1913) in grey. 


©2008 Macmillan Publishers Limited. All rights reserved 


NATURE|Vol 454|14 August 2008 


found close to the position of the main-chain amide of Gln 230, 
although it is positioned less optimally for hydrogen bonding with 
the 3'-hydroxyl of the P-site A76 or with the transition-state ana- 
logue. This could explain why binding of a deacylated tRNA to the A 
site can catalyse peptide release’, albeit at lower rates than the class I 
release factors’. Involvement of the main-chain amide group 
explains why mutation of Gln 230 has only modest effects on cata- 
lysis'’???*°_ Conversely, the absence of a side chain in Gly 229 allows 
the main-chain amide of Gln 230 to approach the 3’-hydroxyl group 
of A76 of the P-site tRNA and, similarly, the transition-state inter- 
mediate, in keeping with the severe defects in peptide release con- 
ferred by mutations in Gly 229 of RF1 (refs 18, 21). 


Communication between the decoding site and the PTC 

The danger of premature termination is of critical concern in release 
factor function. Prevention of hydrolysis of the peptidyl-tRNA link- 
age before recognition of an authentic stop codon requires strict 
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coordination of these two events. This is probably accomplished by 
preventing the final docking of RF1 domain 3 in the PTC until codon 
recognition is established. In the nearly identical crystal structures of 
isolated RF1 and RF2 (refs 26, 37), the orientation of domains 2 and 3 
differs markedly from that observed in their respective termination 
complexes*"*, indicating a propensity for flexibility in their relative 
positioning. This is supported by direct observation of both the 
closed** and open” conformations of free release factors in solution. 
Comparison of the ribosome-bound RF! structure with those of free 
RF1 or RF2 suggests that a key event involves conformational 
rearrangement of a loop (residues 286-300) connecting domains 3 
and 4, which we term the ‘switch loop’ (Figs 1d and 5). The result is 
both to relax its tether on domain 3, allowing it to reach the PTC, and 
to extend 07 by two helical turns that are formed from loop residues 
286-293 (Fig. 5a). On rearrangement, the switch loop is stabilized by 
interactions within a pocket formed between the ribosome and RF1 
(Fig. 5b). The extra turns of helix formed at the end of «7 (Fig. 5a) are 


A2451 je d 
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Figure 3 | Interactions of the GGQ region of RF1 in the PTC. a, Stereo view 
of o,-weighted 3F,,.-2F ac electron density for RF1 (yellow), P-site tRNA 
(orange) and 23S rRNA (grey) contoured at 1.70. b, Position of Gln 230. 

c, Model for product stabilization by hydrogen bonding between the main- 
chain amide of Gln 230 and the 3’-OH of A76 of the P-site tRNA. 


c 
A76 
fe) Hom 
KH 
HN | Gly 229 
N 
Gln 230 
(e) 
e 
Peptide 
HOWw' Om, 
ie} @= : 
H 
HN | Gly 229 
N 
Gln 230 
[@) 
[@) 


d, Superposition of a peptidyl-transferase transition-state analogue (TSA, 
orange) complexed with the 50S subunit (grey) on the structure of the 
termination complex (this work). The main-chain amide of Gln 230 is 
positioned to hydrogen bond with the oxyanion of the TSA. e, Model for 
transition-state stabilization. 
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Figure 4 | Stereo view of the RF1 binding pocket for 23S rRNA nucleotide 


held in position by contacts with the B-sheet of domain 2 of RF1 and 
with residues 1914-1915 in the loop of h69 of 23S rRNA (Fig. 5), 
directing domain 3 into the PTC. 

Rearrangement of the switch loop and positioning of the extended 
07 helix depend on interactions that are made possible by successful 
stop codon recognition. Flipping out of A1493 or failure of A1913 to 
stack on A1493 in h44 not only would result in steric clash between 
these nucleotides and domain 2 of RF1, preventing correct docking of 
its ‘reading head’ on the stop codon, but would also affect formation 
of the binding pocket for the rearranged switch loop and contacts 
needed to position the extended «7 helix. Thus, binding of RFI, 
recognition of the stop codon and the ensuing molecular rearrange- 
ments in both the ribosome and the factor leading to docking of the 
GGQ motif in the PTC are strongly interdependent, cooperative 
events. The critical interactions involving nucleotides 1913-1915 of 
23S rRNA explain the deleterious effects on translation termination 
caused by deletion of helix 69 of 23S rRNA”. 


The mechanism of translation termination 


The structure presented here supports the following model for the 
mechanism of translation termination. The stop codon is bound and 
recognized in a pocket formed by interactions between the f-sheet of 
domain 2 of RF1 (including the conserved PxT motif)—the reading 
head of RFl—and G530, Al1492 and A1493—the three critical 
nucleotides of the 16S rRNA decoding site. A1492 flips out from its 
position in h44, reminiscent of the events of tRNA selection’*”’. 
However, in contrast to the mechanism of sense-codon recognition, 
A1493 remains stacked within h44, allowing docking of the reading 
head of RF1 on the stop codon, with which it would otherwise clash. 
The space vacated by A1492 in h44 is filled by A1913 of 23S rRNA, 
which stacks on A1493, removing itself and A1493 from a possible 
clash with RF1. Interactions between RF1 and these rearranged ele- 
ments of 16S and 23S rRNA induce a conformational change in the 
switch loop of RF1, which forms an extension of the «7 helix, creating 


Figure 5 | Rearrangements and packing of the switch loop of RF1. 

a, Superposition of domain 2 of free RF2 (ref. 37; pink) on that of ribosome- 
bound RF1 (yellow). Rearrangement of the switch loop (red in free RF2 and 
orange in bound RF1) results in reorientation of «7 and extension of its 
length by two helical turns. Free RF2 is shown because of local disorder in the 


856 


NATURE|Vol 454|14 August 2008 


-~ & 
A2602. 23S rRNA is shown in grey, P-site tRNA in orange and RF1 in yellow. 


a rigid connection between the decoding site and domain 3, docking 
the latter in the peptidyl transferase centre. Facilitated by product and 
possibly transition-state stabilization by the main-chain amide group 
of Gln 230 of RFI, the peptidyl-tRNA linkage is then cleaved. The 
position of the side chain of the universally conserved Gln 230 is 
consistent with proposals that it may help to orient the attacking 
water molecule’ and to discriminate water from other potential 
nucleophiles”, although this would need to be confirmed by the 
structure of a substrate complex rather than the product complex 
presented here. An interesting finding is that in the mechanisms of 
both stop-codon recognition and catalysis, elements of the polypep- 
tide backbone of RF1 seem to have a crucial role. Thus, the basic 
functions of translation termination could have originated from 
interactions with simple peptides, possibly as an evolutionary trans- 
ition from a process that may have initially been carried out by RNA, 
a possibility supported by the observation that deacylated tRNA 
bound to the A site is able to catalyse peptide release*””’. 


METHODS SUMMARY 


Ribosomes were purified from T. thermophilus HB27, and were then dissociated 
and reassociated to remove endogenous mRNA and tRNAs as described in 
Supplementary Methods. A 70S termination complex was formed with RF1, a 
defined mRNA and tRNAs, and was crystallized essentially as described in ref. 15. 
The crystal structure was solved by molecular replacement and was refined in 
CNS" and PHENIX”, as described. 
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Compositional differences between meteorites and 


near-Earth asteroids 
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Understanding the nature and origin of the asteroid population in 
Earth’s vicinity (near-Earth asteroids, and its subset of potentially 
hazardous asteroids) is a matter of both scientific interest and 
practical importance’. It is generally expected that the composi- 
tions of the asteroids that are most likely to hit Earth should reflect 
those of the most common meteorites. Here we report that most 
near-Earth asteroids (including the potentially hazardous subset) 
have spectral properties quantitatively similar to the class of 
meteorites known as LL chondrites. The prominent Flora family 
in the inner part of the asteroid belt shares the same spectral 
properties, suggesting that it is a dominant source of near-Earth 
asteroids. The observed similarity of near-Earth asteroids to LL 
chondrites is, however, surprising, as this meteorite class is rela- 
tively rare (~8 per cent of all meteorite falls). One possible 
explanation is the role of a size-dependent process, such as the 
Yarkovsky effect, in transporting material from the main belt. 

Spectroscopic observations of more than 400 near-Earth asteroids 
(NEAs) in visible wavelengths show that 65% of NEAs have S- and 
Q-type spectral properties’. When corrected for discovery biases’, the 
near-Earth population of S- and Q-type asteroids is estimated to be 
36% of the total NEA population. Given the dominance of S- and 
Q-type NEAs and the dominance of ordinary chondrites in meteorite 
falls (~80%), we focus our analysis on the possible correlations 
between these two major groups. 

To explore the mineralogical composition of potentially hazardous 
asteroids (PHAs) and the larger population of NEAs, we used the SpeX 
instrument’ on the NASA Infrared Telescope Facility (IRTF) on Mauna 
Kea, Hawaii. This instrument is capable of measuring the flux of 
reflected sunlight from asteroids over the near-infrared range (0.8- 
2.5 um), which can reveal diagnostic absorption bands due to olivine 
and pyroxene at 1 um and pyroxene at 2 jim (Fig. 1). These absorption 
features are characteristic for both ordinary chondrites and most NEAs. 

Through a joint observational campaign working cooperatively 
between MIT, the University of Hawaii and the IRTF (http://smass. 
mit.edu/minus.html), we have collected near-infrared spectra of 38 
S- and Q-type NEAs, of which 12 fall into the subset of PHAs, defined 
as the objects whose trajectories pass within 0.05 Au from the orbit of 
the Earth. For each of these, the visible wavelength portion of their 
spectrum was also available from previously published studies*. We 
performed a detailed comparison of these telescopically measured 
asteroid spectra with analogous wavelength laboratory measure- 
ments of ordinary chondrite meteorites catalogued in the RELAB 
database (http://www.planetary.brown.edu/relab/). Using RELAB 
measurements for 57 ordinary chondrites, Fig. 1 presents a direct 
comparison between meteorite and asteroid reflectance spectral 
properties, including their observational uncertainties. 


We applied a radiative transfer model’ to analyse and compare the 
mineralogies for both asteroid and meteorite samples. We first 
applied this model to the ordinary chondrite meteorite spectra to 
(1) constrain the composition of ordinary chondrites relative to pet- 
rographic measurements; and (2) define a robust tracer that would 
allow us to perform a direct comparison between these meteorites 
and their possible asteroid parent bodies. We used various end-mem- 
ber minerals found in ordinary chondrites (olivine, orthopyroxene 
and minor components such as clinopyroxene, chromite and iron). 
For olivine, orthopyroxene and clinopyroxene, we considered differ- 
ent chemistries, that is, different Mg numbers’. In agreement with 
petrographic measurements, we found that olivine and orthopyrox- 
ene are the two main minerals in ordinary chondrites and that clin- 
opyroxene is a minor component (<6%). The inclusion of chromite 
and iron neither improved our fits nor changed the abundance of 
olivine, orthopxyroxene and clinopyroxene by more than 3%. Thus, 
we used olivine (ol), orthopyroxene (opx) and clinopyroxene only, 
and defined as a tracer the ratio of inferred abundances for the two 
main minerals, ol/(ol+opx). This ratio varies up to ~40% for 
studied ordinary chondrites. For the three ordinary chondrite sub- 
groups (H, L, LL) we obtain the following average ol/(ol+ opx) ratios: 
H, 58.8 + 4.5%; L, 64.2 + 6.8%; and LL, 75.1 + 4.5%. Each of our 
model-fit average ratios is within 3% of its corresponding value 
derived from direct laboratory analysis’. We find an average ol/ 
(ol+opx) ratio of 63% for all ordinary chondrites in our sample. 

We applied the same radiative transfer model’ to our NEA sample 
(as well as to members of the Flora family, discussed below). To 
account for the spectral reddening (if present) due to space weath- 
ering processes®, we used a space weathering model’. We found sim- 
ilar ol/(olt+opx) ratios for both S- and Q-type NEAs, thus 
demonstrating that space weathering processes do not influence their 
interpreted mineralogies (see Supplementary Information). We 
obtain an average ol/(ol+ opx) value of 69.6% for all of our S- and 
Q-type NEA sample. The distribution of inferred ol/(ol+ opx) values 
for studied ordinary chondrites and NEAs are shown in Fig. 2. We 
weighted each ordinary chondrite subclass (H ~ 43%, L~ 47% 
and LL~ 10%) by their relative fall statistics among all ordinary 
chondrites. 

We find that in both their intrinsic spectral properties (Fig. 1) and 
in our mineralogical analysis (Fig. 2), our NEA sample does not have 
the same distribution as the overall population of ordinary chondrite 
meteorites. On average, NEAs appear more olivine-rich than ordin- 
ary chondrites. The best match occurs between the NEAs and the LL 
ordinary chondrites, where we find that most NEAs (~2/3) give a 
particularly predominant match to the LL chondrites. We define 
compatibility between an NEA and an LL chondrite as occurring 
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Figure 1| Spectral comparisons of asteroids and meteorites. 

a, Comparison between the visible to near-infrared spectral signatures of the 
most common NEAs and ordinary chondrite meteorites. Both populations 
have similar spectral characteristics, displaying broad absorptions (band 1 
and band 2) due to the presence of olivine and pyroxene. For NEAs, these 
bands are usually superimposed on a spectral continuum whose overall 
reflectance increases with wavelength. Asteroids showing this reddish 
continuum are classified as ‘S-types’ while those that match ordinary 
chondrites (neutral slope) are called ‘Q-types’”””®. Space weathering 
processes similar to those acting on the Moon*” can redden and darken the 
Q-type spectrum of a fresh asteroid surface, giving it the appearance of an 
S-type spectrum”’*”””*, The majority (~2/3) of our NEA sample show a 
band 1 centre at wavelengths >1.02 jim, displayed as an average spectrum 
before (NEA average) and after (NEA de-reddened) applying a space 
weathering correction’. The average spectra for the H, L and LL ordinary 
chondrite sub-classes (57 ordinary chondrite spectra in total) and the model 
fit of the average NEA spectrum (ol/(ol+opx) ~ 76%) are displayed. The 
majority of NEAs are spectrally similar to, and have modelled mineralogies 
within the range of, LL chondrite meteorites. b, A quantitative comparison 
and error analysis for the spectral properties for our NEA sample (38 objects; 
circles) and our meteorite sample (57 objects; H, L, L_L symbols). PHAs are 
inscribed with ‘+’. Band 1 centre is defined as the wavelength that bisects 
that band into two equal areas. The band area ratio is band 2/band 1, 
calculated using the modified Gaussian method”. (Differences in 
conventions for fitting the continuum” do not allow these ratio values to be 
directly compared between techniques.) Typical 1¢ observational error bars 
(s.d. of the mean) for the asteroids are exemplified by PHA 1862 Apollo 
(coordinates 0.25, 1.05), based on multiple independent observations. 
(Meteorite measurements are consistently repeatable in the laboratory and 
have uncertainties small compared to their plotted symbols.) Spectral band 
centres and areas are also subject to uncertainties arising from variations in 
grain size and space weathering; the points at lower left depict modelled 
uncertainties arising from extreme ranges of these effects. Within the span of 
all uncertainties, ~2/3 of sampled S- and Q- type NEAs (and equivalently for 
the PHAs) can be seen to overlap the spectral parameter space of LL 
chondrite meteorites. 
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when the ol/(ol+ opx) ratio for the former falls into the LL composi- 
tional range (70-85%). This result is surprising, because LL chon- 
drites are the least abundant ordinary chondrites (they represent only 
10% of all ordinary chondrites, and 8% of all meteorites). It may 
suggest that the NEAs we sample telescopically (with radii of 0.3 km 
to 10 km) are not the immediate parent bodies of smaller objects that 
fall to Earth as meteorites (that is, pre-atmospheric meteorite parent 
bodies having radii on the order of metres). Our interpreted minera- 
logical difference between 0.3-10 km NEAs and ordinary chondrite 
meteorites suggests that different dynamical mechanisms and/or 
main-belt source regions may be responsible for supplying these 
two sample populations (see Supplementary Discussion). 

Dynamical simulations predict that ~40% of all roughly kilo- 
metre-sized NEAs should be delivered from the asteroid belt to 
Earth-crossing orbits by the vg resonance located on the innermost 
edge of the main belt’®. Interestingly, the S-type Flora family, which 
accounts for 15-20% of the asteroids residing in the inner main-belt 
region between 2.1 and 2.5au, borders the ve resonance and is 
expected to contribute substantially to the population of NEAs''. 
Using our same mineralogical modelling technique to analyse spectra 
of the asteroid Flora and its fellow family members, we find they have 
ol/(ol+opx) values in the 76-82% range. This relatively high olivine 
abundance of the Flora family agrees with previous measure- 
ments'”’* and is in excellent agreement with the distribution of ol/ 
(ol+opx) ratios for our NEA sample. 
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Figure 2 | Model results for the ratio of olivine to olivine+orthopyroxene 
for 57 ordinary chondrites and 38 S- and Q-type NEAs. a, Ordinary 
chondrites; b, S- and Q-type NEAs. We also depict the inferred 
compositional ranges for the Flora family and PHAs (8 of 12 observed PHAs 
have ol/(ol+ opx) ratios in the range 73-79%; the 4 others are displayed with 
“+? symbols; error bars are 1a). About 95% (+4%) of H and L ordinary 
chondrites are on the left of the dashed line at ol/(ol+opx) = 70 and ~95% 
(+4%) of LL ordinary chondrites are on the right. The x position of the H, L 
and LL labels indicates their average ol/(ol+opx) value. We define 
compatibility between an NEA and an LL chondrite when the ol/(ol+opx) 
ratio for an NEA falls into the LL compositional range (70-85%). It appears 
that most ordinary chondrites and large S- and Q-type NEAs do not have 
compatible mineralogy. Most NEAs (~63%) are compatible with an LL 
mineralogy (ol/(ol+opx) in the 70-85% range), and account for 10% of the 
ordinary chondrite falls. 
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Our results have two direct implications. First, our observed com- 
positional link of LL chondrites with the Flora family appears incon- 
sistent with Flora as the source for shocked L chondrites’ (~25% of 
all falls), assuming Flora is a relatively homogeneous parent body. 
Second, the particular predominance of PHAs (as a representative 
sample of all NEAs) falling into the LL chondrite category would 
suggest that a majority of terrestrial craters may have been produced 
by LL chondrite impactors. At present there are only sparse data 
showing a mix of L and LL terrestrial impactors’*'®, but this type 
of analysis could serve as a geological test of our results. 

The present observational results also raise, but do not resolve, two 
questions. First, why do approximately 2/3 of all NEAs appear to have 
compositions most similar to a class of meteorites (LL chondrites) 
that comprise only 8% of all meteorite falls? And second, why does 
the Flora family (which we suggest dominates the kilometre-sized 
bodies) not also dominate the flux at the smaller (metre-scale) sizes 
that produce the greatest number of meteorites? Size range may be 
the key difference between observed NEAs and the bodies that fre- 
quently deliver meteorites (see Supplementary Notes). 

Our PHA and NEA sample comprises approximately kilometre- 
sized bodies, whereas meteorite precursor bodies are most frequently 
a few metres in size. We note that size is a key factor in the efficiency of 
the Yarkovsky effect'*'”'*, which causes bodies (of diameter 
D=20km) to drift in orbital semi-major axis and encounter major 
resonances that speed their delivery to the inner Solar System. At the 
largest sizes (kilometres), the Yarkovsky drift is expected to be limited 
and relatively slow (0.01 AU over ~0.5 Gyr), which implies that the 
source regions of observable PHAs and NEAs most probably border 
on resonances. Thus the Flora asteroid family, located at the belt’s 
inner edge, adjacent to the highly effective v¢ resonance, is particu- 
larly suited as a predominant source for delivering kilometre-sized 
objects to the vicinity of Earth. 

If kilometre-sized objects are indeed predominantly delivered 
from the inner asteroid belt (and perhaps the Flora family), then 
Yarkovsky drift and the same v¢ resonance will be even more effective 
for delivering an inbound flux of potential meteorite bodies in the 
metre-size range. At such sizes, Yarkovsky drift is predicted to be 
much more substantial and rapid (0.1 AU over ~50 Myr) throughout 
the entire inner asteroid belt. Thus metre-sized bodies throughout a 
much greater breadth of the asteroid belt may be drifted into a num- 
ber of possible resonances for transport to the inner Solar System. If 
this is the case, the compositional distribution of meteorite falls is 
likely to be very different from that of a sample drawn only from the 
inner edge. Thus a heliocentric-distance-dependent and _size- 
dependent process such as the Yarkovsky effect may account for 
the apparently different compositional distributions we find between 
the kilometre-sized and metre-sized populations of bodies intersect- 
ing the orbit of Earth. 
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Testing the speed of ‘spooky action at a distance’ 


Daniel Salart', Augustin Baas’, Cyril Branciard', Nicolas Gisin' & Hugo Zbinden’ 


Correlations are generally described by one of two mechanisms: 
either a first event influences a second one by sending information 
encoded in bosons or other physical carriers, or the correlated 
events have some common causes in their shared _ history. 
Quantum physics predicts an entirely different kind of cause for 
some correlations, named entanglement. This reveals itself in cor- 
relations that violate Bell inequalities (implying that they cannot 
be described by common causes) between space-like separated 
events (implying that they cannot be described by classical com- 
munication). Many Bell tests have been performed’, and loopholes 
related to locality’* and detection’® have been closed in several 
independent experiments. It is still possible that a first event could 
influence a second, but the speed of this hypothetical influence 
(Einstein’s ‘spooky action at a distance’) would need to be defined 
in some universal privileged reference frame and be greater than 
the speed of light. Here we put stringent experimental bounds on 
the speed of all such hypothetical influences. We performed a Bell 
test over more than 24 hours between two villages separated by 
18km and approximately east-west oriented, with the source 
located precisely in the middle. We continuously observed two- 
photon interferences well above the Bell inequality threshold. 
Taking advantage of the Earth’s rotation, the configuration of 
our experiment allowed us to determine, for any hypothetically 
privileged frame, a lower bound for the speed of the influence. For 
example, if such a privileged reference frame exists and is such that 
the Earth’s speed in this frame is less than 10” * times that of the 
speed of light, then the speed of the influence would have to exceed 
that of light by at least four orders of magnitude. 

According to quantum theory, quantum correlations violating Bell 
inequalities simply happen, somehow from outside space-time, in 
the sense that there is no space-time explanation for their occurrence: 
there is no event here that somehow influences another distant event 
there. Yet such a description of correlations, which is radically dif- 
ferent from all those found in any other part of science, should be 
thoroughly tested. 

In 1989, Eberhard’ realized that the existence of a hypothetically 
privileged reference frame can be experimentally tested for. The idea 
is that the speed of this influence, although greater than the speed of 
light, is finite. Hence, if the events are simultaneous in the hypothet- 
ically privileged frame, then the signal does not arrive on time and no 
violation of Bell inequalities should be observed. Note that if the 
events are simultaneous in some reference frame, then they are also 
simultaneous with respect to any reference frame moving in a dir- 
ection perpendicular to the line joining the two events. Accordingly, 
Eberhard proposed (Ph. H. Eberhard, personal communication) to 
perform a Bell test over a long distance oriented east-west, over a 
period of 12 hours (ref. 8). If the events were simultaneous in the 
Earth’s reference frame, then they would also be simultaneous with 
respect to all frames moving in the plane perpendicular to the east— 
west axis, and in 12 hours all possible hypothetically privileged 
frames would be scanned. An intriguing alternative, that we will 


not follow here, is to speculate that there is a tachyonic field’ that 
couples specifically to entangled particles. 

Bohm’s"° pilot-wave model of quantum mechanics is an example 
of a theory containing an explicit spooky action at a distance, which 
requires the assumption that there is a universally privileged frame'’. 
Also, if the spooky action at a distance propagates at finite speed, then 
an experiment like the one presented below could possibly falsify the 
pilot-wave model'’. Reference 12 stresses that the existence of a uni- 
versally privileged frame would not contradict relativity. 

By 2000, a Bell experiment along the lines presented above had 
already been analysed*'*'*. However, the analysis concerned only two 
hypothetically privileged reference frames. The first frame was 
defined by the ~2.7K cosmic microwave background radiation. 
The second frame analysed was the ‘Swiss Alps reference frame’, that 
is, not a universal frame but merely the frame defined by the massive 
environment of the experiment. The assumption that the privileged 
frame depends on the experiment’s environment leads naturally to 
the issue of situations in which the massive environments on both 
sides of the experiment differ, and this was the main subject of the 
experiment in 2000 (refs 13, 14). In both of these analyses, the hypo- 
thetical superluminal influence was termed the speed of quantum 
information to stress that it is not classical signalling. We shall use 
this terminology, but we emphasize that this is only the speed of a 
hypothetical influence and that our result casts very serious doubts 
on its existence. For views on the speed of quantum information, see 
ref. 15. 

Before presenting our experiment and results, let us clarify the 
principle of our measurements and how to obtain bounds on this 
speed of quantum information in any reference frame. 

In an inertial reference frame centred on the Earth, two events A 
and B (in our experiment two single-photon detections) respectively 
occur at positions rg and rg at times ft, and fg. Let us consider another 
inertial reference frame F, the hypothetically privileged frame, rela- 
tive to which the Earth frame moves with a velocity v (Fig. 1). When 
correlations violating a Bell inequality are observed, the speed of 
quantum information Vg in frame F that could cause the correlation 
is bounded from below as follows: 


where (r’,,?’,) and (r’p, t's) are respectively the coordinates of 
events A and B in frame F, obtained from (ra, t,) and (rg, tg) by 
Lorentz transformation. After simplification, we obtain 


(“*) 21 = B= 6%) 
c (p+ Bi)” 


where f = v/c is the speed (v=||v||) of the Earth frame in frame F 
relative to the speed of light (c); 8), = v)/c, with v the component of v 
parallel to the A—B axis; and p = ctap/rap quantifies the alignment of 
the two events in the Earth frame, with taj=t,—t, and 
ras = ||¥g — Fal]. In the following, we will consider space-like separated 


(1) 
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events, for which || < 1: the bound on Vqy in equation (1) will then be 
larger than c. For a given privileged frame F, this bound depends on the 
orientation of the A-B axis through |, and on the alignment p. To 
obtain a good lower bound for Vg, we should bound the term 
(p + Bi)” from above by the smallest possible value, during a period 
of time T needed to observe a Bell violation (which, in our experiment, 
will be the integration time of a two-photon interference fringe). 

To gain some intuition, we first consider the simple case in which 
p = 0 (the two events are perfectly simultaneous in the Earth frame) 
and the A-B axis is perfectly aligned in the east—west direction. Then, 
when the Earth rotates, there will be a moment fp when the east—west 
direction is perpendicular to v, that is, Bj)(t)) = 0. During a small time 
interval around fo, we can bound |fj)(t)| by a small value, and thus 
obtain a high lower bound for Voy. 

In principle, the alignment p could be optimized for each privi- 
leged frame that we wish to test, so as to decrease the bound that can 
be put on (p + f i) during the time interval T (and increase the term 
(1 — p*) at the same time). In our experiment, however, because we 
want to scan all possible frames, we do not optimize p for each frame; 
instead, we align the detection events such that |p| = p < 1, where p 
is our experimental precision on the alignment p. We then use the 
fact that (1 —p°)/(p+ By) >(1—p’)/(p + BI)” to get the follow- 
ing bound: 


(2) 


2 = 
(“2) ois a= )—P) 
c (p +|Bi|) 


The problem reduces to bounding |f\| directly. 

In the configuration of our experiment, the A-B axis is almost, but 
not perfectly, oriented along the east-west direction. Consequently, 
the component f)/(t) has a 24-hour period, and geometric considera- 
tions show that it can be written as (see the Supplementary 
Information) 


Bi\(t)=B cos x sin «+ f sin x cos acos wt 


where x is the zenith angle of v, x is the angle between the A-B axis 
and the equatorial x-y plane (Fig. 1), and w is the angular velocity of 
the Earth. 


Figure 1| Reference frames. The Earth frame moves with respect to a 
hypothetically privileged reference frame F at a speed v. The zenith angle 
between v and the z axis can have values between 0° and 180°. The A-B axis 
forms an angle « with the equatorial (x—y) plane. «, angular velocity of the 
Earth. 
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As we show in the Supplementary Information, in order to bound 
|Bj)| from above during a period of time T, we can consider two cases: 
one in which v does not point close to a pole (C;|tanz| > |tang|), and 
one in which it does (Cy7|tany| <|tana|). Here Cr= cos*(wT/4), 
which ~1 when wT is small. There exists a time interval of length 
T during which |f))(2)| is bounded from above by 


oT 
|B \(sin? x cos? a— cos? x sin” aa 


in the first case and by 


T 
|| ( cos ¥ sin a| —| sin x cos | cos) 


in the second case. These bounds, together with equation (2), provide 
the desired lower bound for Vay. 

We now describe our experiment. In essence it is a large Franson 
interferometer’. A source situated in our laboratory in Geneva emits 
entangled photon pairs using the standard parametric down-conver- 
sion process in a nonlinear crystal (here a continuous-wave laser 
pumps a waveguide in a periodically poled lithium niobate crystal)”. 
Using fibre Bragg gratings and optical circulators, each pair is deter- 
ministically split and one photon is sent through the Swisscom fibre 
optic network to Satigny, a village west of Geneva, and the other 
photon is sent to Jussy, a village east of Geneva. The two receiving 
stations, located in those two villages, are separated by a direct dis- 
tance of 18.0 km (Fig. 2). We use energy—time entanglement, a form 
of entanglement well suited to quantum communication in standard 
telecommunications fibres’*. At each receiving station, the photons 
pass through identically unbalanced fibre-optic Michelson interfe- 
rometers. The imbalance (~25 cm) is larger than the single-photon 
coherence length (~2.5 mm), meaning that any single-photon inter- 
ference is avoided, but is much smaller than the pump laser coher- 
ence length (>20m). Accordingly, when a photon pair is detected 
simultaneously in Satigny and Jussy, there is no information about 
which path—the long arm or the short arm—the photons took in 
their interferometers. But because the photons were also emitted 
simultaneously, they must both have taken the same long or short 
path. This indistinguishability leads, as always in quantum physics, to 
interference between the long—long and short-short paths. 
Continuously scanning the phase in one interferometer (at Jussy), 
while keeping the other one stable, produces a sinusoidal oscillation 
of the correlation between the photon detections at Satigny and Jussy 
(Fig. 3). The phases were controlled by the temperature of the fibre- 
based interferometers. 


Underground fibre fength13.4 km ere i 
Fibre coil length: 4.1 km = pbre lengin: 17.5'km 


Total fibre length: 17.5 km . UL * _-Btance 2km 
— Switzerland 


Figure 2 | Experimental setup. The source sends pairs of photons from 
Geneva to two receiving stations through the Swisscom fibre-optic network. 
The stations are situated in two villages, Satigny and Jussy, that are 
respectively 8.2 and 10.7 km from Geneva. The direct distance between them 
is 18.0 km. At each receiving station, the photons pass through identically 
unbalanced Michelson interferometers and are detected by a single-photon 
InGaAs avalanche photodiode (APD) (id201, id Quantique). The length of 
the fibre going to Jussy is 17.5 km. The fibre going to Satigny is only 13.4 km 
long, so we added a fibre coil of 4.1 km (represented as a loop) to equalize the 
lengths of the fibres. Having fibres with the same length allows us to satisfy 
the condition of good alignment (p <1). 6 indicates the scanning of the 
phase of the interferometer at Jussy. 
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Figure 3 | Interference fringes. Interference fringes with a period T = 900s 
obtained during a 4-hour measurement, fitted with a sinusoidal function 
yielding a visibility of 87.6% + 1.1%. If we subtract the accidental 
coincidences, the (net) visibility climbs to 94.1% + 1.0%. This result is 
significant because the period of the interference fringes remains stable for a 
very long time, which allows us to fit the entire measurement with a 
continuous fit and obtain a high visibility value. 


Interference fringes were recorded in many runs, which usually 
lasted several hours (up to 15 hours for the longest run (see Fig. 4)). 
By juxtaposing several of these measurement runs obtained over 
several weeks, we covered a 24-hour period with interference fringe 
periods of T= 360s with visibilities well above the threshold (1//2) 
set by the Clauser-Horne-Shimony-—Holt Bell inequality’. The vis- 
ibility is large enough to exclude any common-cause explanation. 
The correlations are thus due either to entanglement, as predicted by 
quantum physics, or to some hypothetical spooky action at a distance 
whose speed we wish to bound from below. Because long measure- 
ments of fringes with short fringe periods T are difficult to fit con- 
tinuously, we fitted the data over a time window corresponding to 
one-and-a-half fringes and scanned this time window, as explained in 
the Supplementary information. 

The violation of the Bell inequality at all times of day makes it 
possible to calculate the lower bound for the speed of quantum 
information in any reference frame. This bound depends on the 
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Figure 4 | Visibility fits. Visibility fits for several uninterrupted runs 
obtained at different times of the day. Together these runs cover each 
moment of the day at least twice. The limitations on the lengths of these 
measurements were due to the end of the cooling ramp and small 
instabilities in the set-up that produced short interruptions in the scan. 
Visibility values remain above the threshold value of 1//2 (black line) set by 
the Clauser-Horne-Shimony-—Holt Bell inequality at all times. 
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precision of the alignment in the actual experiment; see equation 
(2). We wished to have a good alignment (p < 1), so the difference 
in the arrival times of the single photons t,g was minimized. 

First, the length of each fibre between the source and the single- 
photon detectors was measured. The long fibres (several kilometres in 
length) were measured using a single-photon optical time-domain 
reflectometer” and the short fibres (less than 500 m in length) were 
measured using an optical frequency-domain reflectometer’'. The 
fibre on the Satigny side was found to be shorter than the other by 
4.1 km. We added a fibre coil to the short side (represented as a loop in 
Fig. 2), reducing this difference to below 1 cm with an uncertainty of 
1 cm, which corresponds to a light travel time of 49 ps. To remove any 
doubt about where exactly the measurement took place, we adjusted 
the lengths of the fibres from the source to the fibre couplers inside 
each interferometer and also to the photodiodes (where the photons 
are detected). Hence, the configuration was totally symmetric. 

Next, we considered the chromatic dispersion in the fibres. 
Chromatic dispersion added an uncertainty in the arrival times, 
and because the entangled photons were anticorrelated in energy, 
their time delays were always opposite to each other, which always 
increased this uncertainty. Chromatic dispersion was measured to be 
18.2psnm~'km™! using a chromatic dispersion analyser”. For a 
spectral half-width of AA=0.5nm and twice the distance of 
17.5 km, this is equivalent to a 319-ps uncertainty. Thus, the overall 
uncertainty in the relative lengths of the fibres was, when expressed in 
light travel time, ty = 323 ps. This, together with the direct distance 
between the receivers, rap = 18.0 km, allowed us to estimate the pre- 
cision of our alignment: |p| = p =5.4X10°°<1. 

Last, we used equation (2) to calculate a lower bound for Vo). We 
used the value of p just calculated, the period of time T = 360s needed 
to observe a Bell violation (corresponding to the interference fringe 
period), and the angle formed by the axis between the two receiving 
stations (the A-B axis) and the equatorial (x-y) plane, « = 5.8°. The 
results are shown in Fig. 5, for certain hypothetically privileged frames. 
In Fig. 5a, we scan all possible directions x, but set the Earth’s relative 
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Figure 5 | Lower bounds for the speed of quantum information. a, Bound 
obtained for Vgj/c as a function of the angle x, for B = 10°. For y < wor 
Y% Z 180° — x, the bound is obtained by considering the second case 
discussed in the text, whereas for x < v7 < 180° — a, the bound is obtained 
by considering the first case. The bound at 7 = 90° is Vay = 54,000c. 

b, Bound obtained for Vqj/c as a function of the speed f, for y = 90°. As f 
tends to zero, our bound on Vqj/c tends to 1/p. 
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speed at 6 = 10° *. A lower bound for Vg; greater than 10,000 times the 
speed of light is found in any such reference frame. The imperfect east— 
west orientation («% # 0) is responsible for the minimum values of the 
bound at angles 7 near 0° and 180°. For smaller Earth speeds, the bound 
on Vor is even larger. However, if f is very large, then the correspond- 
ing bound on Vg, is less stringent. To illustrate this, in Fig. 5b we set 
x = 90°, that is, with v in the equatorial plane, and scan the velocity /. 
Indeed, for / ~ 1, the bound drops rapidly. Recall, however, that for 
large values of } we could in principle optimize the alignment p in the 
experiment to get a better bound on Vg;. For small values of £, our 
bound is limited by the inverse of our precision of alignment p. 

In conclusion, we performed a Bell experiment using entangled 
photons between two villages separated by 18 km and approximately 
east-west oriented, with the source located precisely in the middle. 
The rotation of the Earth allowed us to test all possible hypothetically 
privileged frames over a period of 24 hours. Two-photon interference 
fringes with visibilities well above the threshold set by the Bell 
inequality were observed at all times of day. From these observations 
we conclude that the nonlocal correlations observed here and in 
previous experiments’ are indeed truly nonlocal. To maintain an 
explanation based on spooky action at a distance we would have to 
assume that the spooky action propagates at speeds even greater than 
the bounds obtained in our experiment. 
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Fullerenes from aromatic precursors by 
surface-catalysed cyclodehydrogenation 
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F. Javier Palomares', Noemi Cabello’, Miguel A. Basanta’, José Ortega’, Javier Méndez’, Antonio M. Echavarren*, 
Rubén Pérez’, Berta Gdmez-Lor’ & José A. Martin-Gago’” 


Graphite vaporization provides an uncontrolled yet efficient 
means of producing fullerene molecules. However, some fullerene 
derivatives or unusual fullerene species might only be accessible 
through rational and controlled synthesis methods. Recently, such 
an approach has been used’ to produce isolable amounts of the 
fullerene C,, from commercially available starting materials. But 
the overall process required 11 steps to generate a suitable poly- 
cyclic aromatic precursor molecule, which was then dehydro- 
genated in the gas phase with a yield of only about one per cent. 
Here we report the formation of Cgo and the triazafullerene C57N3 
from aromatic precursors using a highly efficient surface- 
catalysed cyclodehydrogenation process. We find that after depos- 
ition onto a platinum (111) surface and heating to 750K, the 
precursors are transformed into the corresponding fullerene and 
triazafullerene molecules with about 100 per cent yield. We expect 
that this approach will allow the production of a range of other 
fullerenes and heterofullerenes*’, once suitable precursors are 
available. Also, if the process is carried out in an atmosphere con- 
taining guest species, it might even allow the encapsulation of 
atoms or small molecules to form endohedral fullerenes*”. 

The surface-catalysed cyclodehydrogenation process we use to 
transform complex organic polyaromatic precursors of suitable 
topology into targeted fullerene species is sketched in Fig. 1 (see also 
Supplementary Fig. 1). Through vacuum thermal evaporation, we 
first deposit the precursor (C57H33N; (1) in the case of triazafullerene 
and C¢oH3p (2) in the case of fullerene) on a catalytically active metal 
surface. Subsequent annealing of the sample at 750 K induces a sur- 
face reaction that produces the corresponding closed molecule. 

The syntheses of the planar precursors used in this work follow a 
previously published methodology®* (see also Supplementary 
Information, section 1.1). Their molecular structures are optimized 
using two different ab initio total-energy methods”"° (Supplementary 
Information, Section 1.2); Fig. la shows a ball-and-stick model of 
the optimized geometry we obtain for 1. Compounds 1 and 2 are 
markedly twisted, with each of the three lobes adopting a helical 
shape because steric congestion of their bay positions forces the outer 
aromatic ring of each lobe to flip up with respect to the plane of the 
central ring. The twisted conformation can be described by the dis- 
tance the most external carbon atom is placed above the plane of the 
central aromatic ring (Az), and by the twisting angle of each molecu- 
lar branch. These values are respectively found to be 0.08 nm and 
20.5° for 1 and 0.09 nm and 30.5° for 2. 

We have probed the cyclodehydrogenation process using scanning 
tunnelling microscopy (STM), X-ray photoemission spectroscopy 


Figure 1| Optimized geometrical structure of the C;7H33N3 molecule at the 
different stages of the process. a, Representation of the optimized 
molecular structure of the C;7H33N;3 molecule. The wings of the molecule 
display a noticeable helicoidal twist. b, The same molecule adsorbed at room 
temperature (300 K) on the platinum (111) surface. The final relaxed 
configuration strongly depends on the adsorption site. The structure shown 
corresponds to one of the most energetically favourable sites, where the 
interaction with the metal substrate removes the twist and favours a 
markedly planar configuration for the molecule. c, The optimized structure 
for the C;7N; triazafullerene formed after the cyclodehydrogenation process 
at 750 K. Blue balls represent the nitrogen atoms in the structure. 
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(XPS) and thermal programmed desorption (TPD) in an ultrahigh 
vacuum (UHV) environment. Figure 2a shows an STM image of 1 
deposited on a platinum (111) surface with the ball-and-stick model 
of the adsorbed molecule superimposed. The size and topology of the 
STM image, with three wings characterizing the molecular shape, 
confirm that the molecules do not fragment, but retain their planar 
structure upon adsorption. A more intense protrusion (lobe) is 
imaged in each of the wings. These lobes are not centred; rather, their 
positions correspond to those of the hexagonal external rings, as can 
be seen by comparing the superimposed molecule with the STM 
image in Fig. 2a. 

In a first approximation, we might speculate that the lobes corre- 
spond to the molecular orbitals of the free molecule. However, the 
detailed relaxed geometry is far from that of the isolated molecule: 
our ab initio calculations on a large 10 X 10 unit cell show that the 
interaction with the platinum surface is strong enough to deform the 
structure of the molecule. The competition between the m bonding of 
the rings with the surface and the energetic costs of the sterically 
induced molecular deformations results in a very complicated energy 
landscape, with barriers between local minima determined by the 
adsorption geometry. The general trend is a reduction of the molecu- 
lar buckling (for example, Azis reduced from 0.08 nm to 0.02 nm for 
1) as the molecular wings approach the surface to maximize the 
number of bonds with the surface (see also the relaxed configuration 
shown in Fig. 1b). The effect brings the hydrogen atoms on the wings 
closer to the surface, and thus plays an important role in the tem- 
perature-induced fullerene formation by surface-catalysed dehydro- 
genation. The strong molecule-surface interaction suggested by 
the calculations can also explain why we fail to observe long-range 
ordering of adsorbed molecules or preferential adsorption at step 
edges (which would require mobility of the molecule on the platinum 
surface). Similarly, the observation that in some imaged molecules 
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Figure 2 | STM images of the cyclization process. a, The C;7H33N3 
molecule deposited on the platinum (111) surface, with the atomic structure 
superimposed following the calculations. b, 10 X 10 nm” image of the 
platinum (111) surface after evaporation of about 0.2 monolayers of 
Cs7H33N3 molecules. c, An isolated C37N3 molecule obtained after annealing 
the surface at 750 K. The cyclization process is inferred from the appearance 
of different molecular orbitals and changes in the shape and size of the 
imaged molecule. d, 10 < 10 nm?” image of the surface shown in b, but after 
annealing at 750 K. Inset, XPS spectrum of about 0.8 monolayers of 
C;7H33N3 molecules after annealing at 750 K. The best-fit curve of the 
spectrum, together with the different components used in the fit, is also 
shown. 


866 


NATURE|Vol 454|14 August 2008 


the different lobes have different apparent heights is consistent with 
our calculations, which indicate that the final molecular geometry 
strongly depends on the adsorption site. 

As deposited on the platinum (111) surface (Fig. 2a), the hetero- 
polyarene 1 (and similarly the polyarene 2) presents a total apparent 
height in the STM images of about 0.14nm, with no significant 
dependence on bias voltage. The triangular base length of the mole- 
cule is about 2.2 nm. Upon annealing the sample at 750 K, a surface 
reaction takes place and the triangular molecules imaged on the 
surface transform into round molecules with an apparent height of 
about 0.38 nm and a diameter of 1.5nm (Fig. 2c). Evaporation of 
commercial Ceo on a platinum surface and imaging under the same 
experimental conditions results in very similar STM images for the 
adsorbed fullerenes (Supplementary Fig. 2). A height of about 0.4nm 
and a diameter of about 1.35 nm have been reported for Ceo depos- 
ited on a platinum (110) surface", and a height of about 0.31 nm for 
C60 deposited on a palladium (110) surface’’. In both these cases, the 
low height appears after surface annealing and suggests strong bond- 
ing of the molecule to the surface, in agreement with suggestions that 
the interaction of a fullerene with a platinum surface is strongly 
covalent and that the molecule tries to maximize the number of 
surface bonds'*. We also note that the molecular orbitals visible in 
the STM images are similar to those reported for Ceo on platinum”, 
silicon'® and gold’® surfaces and are fairly well reproduced by our 
calculations (Supplementary Information, section 2.2); they might 
thus be considered as a indicator of fullerene formation. 

The cyclization process shows efficiency close to 100%; that is, all 
the adsorbed molecules are transformed into fullerenes 
(Supplementary Information, section 2.3). This is illustrated by the 
images in Figs 2b, d of the surface covered with about 0.2 monolayers 
of precursor before and after annealing, respectively: the molecular 
coverage does not change as a result of the temperature-induced 
reaction, with all of the triangular molecules converted into round 
fullerenes. 

Further evidence for the thermally induced cyclization of 1 is pro- 
vided by XPS spectra of the surface covered with about 0.8 monolayers 
of precursor (Fig. 2d inset). Even thought this corresponds to a nitro- 
gen coverage of only about 0.04 monolayers with respect to the plat- 
inum surface, weak emission from the nitrogen 1s core level is detected 
and confirms that after annealing, nitrogen is present on the surface 
(and is presumably associated with the unique topographic STM sur- 
face features, the fullerene molecules). Information on the chemical 
state of the nitrogen atoms is obtained from a core-level line-shape 
analysis of the nitrogen 1s spectrum, with its two peaks. The binding 
energy of the main peak at 400.6 eV corresponds to substitutional 
nitrogen ina graphite sheet, as reported for nitrogen bonded to carbon 
atoms in sp” hybridization’’. This is the nitrogen coordination in the 
triazafullerene we have formed. The binding energy of the small peak 
at 398.2 eV is related to PtN,. compounds, in which nitrogen is che- 
misorbed'’. Therefore, the smaller peak can be attributed to nitrogen 
atoms in the fullerene cage interacting with the platinum surface. This 
core-level analysis thus indicates that about one-third of the nitrogen 
atoms of the triazafullerene C;7N; interact with the surface, whereas, 
as a consequence of the heterofullerene curvature (see Fig. 1c), the rest 
are not in contact with the surface. The XPS observations also confirm 
the 100% efficiency of the cyclodehydrogenation process because the 
nitrogen 1s intensity does not change significantly after annealing, 
indicating that all the deposited molecules have cyclized 
(Supplementary Information, section 2.5). 

The strong covalent interaction between the precursor molecules 
and the platinum surface appears important for the efficiency of the 
process. To test this idea, we evaporate 1 and 2 ona gold (111) surface 
that is quite inert and does not promote the cyclization reaction. The 
molecule—substrate interaction is negligible on this surface and, con- 
sequently, molecular diffusion at room temperature is enhanced, 
inhibiting the dehydrogenation process. The cyclization efficiency 
is reduced in this case to about 1%, with most of the molecules 
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decomposing. In the test with polyarene 2, the few isolated Ceo mole- 
cules that are detected on the gold surface are found to have molecu- 
lar orbitals similar to those previously reported’. 

The thermally induced surface reaction generating fullerenes con- 
sists of a series of dehydrogenation reactions. Recently, dehydrogena- 
tion of adsorbed aromatic molecules was attributed to activation by the 
electronic current from the tip of a scanning tunnelling microscope’. 
To exclude such an activation mechanism and to confirm a surface- 
catalysed dehydrogenation process in our system, we measure the 
dihydrogen signal in a mass spectrometer as a function of annealing 
temperature. To avoid experimental artefacts due to dihydrogen 
desorption from the experimental apparatus, we synthesize 
2,3,7,8,12,13-hexadeutero-10,15-dihydro-5 H-diindolo[3,2-a:3’,2’-c] 
carbazole (C;7H27D¢N3, or 1-dg), a deuterated analogue of C;7H33N3 
(1) incorporating six deuterium atoms at strategic positions. Figure 3 
shows the evolution of m/z 3 (HD) and m/z 4 (D2) mass signals as a 
function of temperature. The baseline corresponds to the m/z 3 and 4 
mass signals recorded when annealing the clean platinum surface (that 
is, before any molecular deposition) up to 750 K. The mass spectrum 
shows that a rapid desorption process takes place at about 700 K and 
should be associated with the molecular transformation shown in 
Fig. 2. The STM observations agree with the mass spectrometry results 
(Fig. 3), which indicate that there is neither modification of the 
molecular shape nor dehydrogenation below 400 K and that we could 
have partial dehydrogenation at about 500 K. STM images recorded 
after annealing between 400 and 600 K show asymmetrical molecules 
that could correspond to intermediates, that is, partially closed full- 
erenes that could be regarded as open-cage fullerenes”®”’. 

The various experimental observations presented above provide 
insight into the cyclization mechanism. The observed formation of 
HD (m/z 3) can be attributed to the exchange of H2 and Ds, which is 
known to be catalysed by platinum with an activation energy of 
0.23 eV (ref. 22) and is enhanced in our system owing to the relatively 
high temperature of the dehydrogenation process. Therefore, hydro- 
gen and deuterium are gathered by the platinum surface, where they 
diffuse, react with each other and finally desorb as H2, HD and D3. The 
platinum-induced dehydrogenation of the precursor that produces 
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Figure 3 | TPD experiment of the cyclodehydrogenation process. 
Temperature evolution of the masses corresponding to HD (m/z 3) and D, 
(m/z 4), recorded after depositing C;7H,7D,.N; on the surface. Thermal 
desorption takes place at around 500 K. The m/z 4 signal has been scaled by a 
factor of eight. The black line is the result of recording the HD signal without 
depositing molecules on the platinum surface. 
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the hydrogen and deuterium atoms is facilitated by the strong inter- 
action of the molecule with the platinum surface. According to our 
density functional theory calculations, this interaction distorts the 
molecule in such a way that the molecular wings approach the surface 
and enable platinum-catalysed dissociation of hydrogen atoms from 
the adsorbed molecule (Fig. 1b). Conversely, if the molecule does not 
interact strongly with the surface, as in the case of a gold (111) surface, 
cyclodehydrogenation does not take place. After consecutive release of 
the hydrogen atoms to the surface, the molecule is assumed to spon- 
taneously cyclize. We note that although a complete density functional 
theory simulation of this complex process is not possible, our theor- 
etical calculations for the Cg9H3o free molecule provides evidence that 
dehydrogenation induces a strong tendency towards cyclization. The 
removal of hydrogen atoms close to the central ring triggers confor- 
mational changes such that the molecular wings approach each other 
and a fullerene semi-cage is formed in a process that proceeds without 
an energy barrier. Furthermore, a thorough study of the completely 
dehydrogenated molecule shows that the different possible paths lead- 
ing from the planar conformation to the complete fullerene cage have 
energy barriers less than 0.3 eV, which can be easily overcome with the 
thermal energy available during annealing (Supplementary 
Information, section 2.4). 

To explore the feasibility of both our controlled synthesis method 
outside an UHV environment and the potential for scale-up, we use 
platinum nanopowder instead of a single-crystal substrate. In 
this experiment, compound 2 is deposited from a CHCl, solution 
onto a large excess of platinum nanopowder and heated under 
vacuum (10 “ mbar) at 700K for 20 min. For this sample, the laser 
desorption—ionization time-of-flight mass spectrum in negative-ion 
mode clearly shows the presence of Ceo (Supplementary Information, 
section 2.6). 


METHODS SUMMARY 


The syntheses of Cg9H39 and C;7H33N3 were performed by alkylation of 10,15- 
dihydro-5H-diindeno[1,2-a:1’,2’-c]fluorene (truxene) and, respectively, 10,15- 
dihydro-5H-diindolo[3,2-a:3’,2’-c]carbazole (5,10,15-triazatruxene)**. 
Cyclization experiments have been performed in situ in an UHV system with a 
base pressure of 1 X 10 '°mbar equipped with Auger electron spectroscopy, 
low-energy electron diffraction optics, STM at room temperature (300 K) and 
TPD. The molecules were deposited in situ by thermal evaporation on a prev- 
iously atomically cleaned platinum (111) surface. Calculations are based on 
density functional theory as implemented in two different total-energy pseudo- 
potential codes: FIREBALL (ref. 9), where a local orbital basis is used to expand 
the electronic wavefunctions, and the plane-wave-basis approach CASTEP (ref. 
10). Laser desorption—ionization time-of-flight experiments were performed 
using a Bruker Daltonics Autoflex mass spectrometer equipped with a nitrogen 
laser (337 nm). The instrument was operated in negative-ion, reflectron mode. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


The syntheses of CggH39 and C;7H33N3 were performed by alkylation of 10,15- 
dihydro-5H-diindeno[1,2-a:1’,2’-c]fluorene (truxene) and, respectively, 10,15- 
dihydro-5H-diindolo[3,2-a:3’,2’-c]carbazole — (5,10,15-triazatruxene) with 
1-bromo-2-bromomethylnaphthalene, followed by a triple palladium-catalysed 
arylation as previously reported**. The synthesis of the hexadeuterated analogue 
C57H27D¢6N3 was performed according to the same procedure but starting from a 
hexadeutero-5,10,15-triazatruxene, obtained by reductive deuteriodebromina- 
tion of hexabromo-5,10,15-triazatruxene with formic acid-d, and Et3N. 

Calculations were based on density functional theory as implemented in two 
different total-energy pseudopotential codes: FIREBALL’, where a local orbital 
basis is used to expand the electronic wavefunctions and CASTEP”, a plane- 
wave-basis approach. FIREBALL provides the balance of accuracy and efficiency 
needed to perform calculations with the extremely large unit cells (involving 
almost 400 atoms) that are required to study the adsorption of the planar full- 
erenes on the metal substrates. Its reliability has been checked against CASTEP 
results for the different free molecules. 

To model the adsorption of the planar fullerene-precursors on the platinum 
(111) surface, we considered a 10 X 10 periodic slab including the molecule and 
three metal layers. Only the J’ point was included in the sampling of the Brillouin 
zone. The FIREBALL code with the local density approximation for the 
exchange-correlation functional was used for these calculations. All atoms in 
the molecule and the two upper metal layers were allowed to relax to their 
ground-state configuration with convergence criteria for the total energy and 
forces of 10 *eV and 0.05 eV A’, respectively. 

UHV experiments were performed in a system that had a base pressure of 
1X10 '° mbar, and Auger electron spectroscopy, low-energy electron diffrac- 
tion, STM at room temperature and TPD techniques were used. 

We prepared the clean platinum (111) surface through repeated cycles of Ar* 
bombardment and annealing at up to 1,250 K for 15 min in an O, atmosphere 
(1 X 10° mbar). The last annealing cycle was performed after removing the O, 
atmosphere from the chamber and keeping the total pressure at around 
10° '° mbar. To prepare the clean gold (111) surface, we used the same meth- 
odology but annealed at 700 K without O2 atmosphere in UHV (pressure during 
annealing, 4 x 10 '° mbar). To check for surface cleanness we used Auger elec- 
tron spectroscopy, low-energy electron diffraction and STM techniques. Typical 
values for STM images were about +500 mV (on the sample) and 0.2 nA. XPS 
spectra were recorded in a different system equipped with a SPECS Phoibos150 
electron spectrometer and a delay-line detector in the nine-segment mode, using 
monochromatic Al Ko radiation (1486.74eV). The pressure in the analysis 
chamber was less than 5 X 10 '° mbar. High-resolution XPS spectra, such as 
that shown in Fig. 2, were obtained with a pass energy of 10 eV, which provided 
an overall energy resolution of 0.39 eV as measured at the Fermi edge of a silver 
reference sample. 

To evaporate the organic molecules we used a homemade tantalum dispenser, 
which was calibrated with a thermocouple to fix the evaporation temperature 
accurately at 675K. The molecular source was previously degassed without 
molecules up to 1,000K in high vacuum (1X 10 ’mbar). The CeoHso; 
Cs7H33N3, C57H27D6N3 and commercial Cgp (98% purity; Sigma) molecules 
were degassed at 660K in UHV (3 X 10 '°mbar) for two days to eliminate 
any organics used in the synthesis. 

We performed the synthesis of Cgo over platinum nanopowder by dropping a 
solution of 2 (0.5mg) in CHCl; over 20mg of activated platinum powder 
(approximate particle size, 100 nm; Aldrich). The solvent was evaporated under 
a nitrogen flow and the mixture was heated at 700 K in a Carbolite CTF/12/65/ 
550 tube furnace with a temperature controller device for 20 min under vacuum 
conditions (10° * mbar). 
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North American ice-sheet dynamics and the onset of 
100,000-year glacial cycles 


R. Bintanja’? & R. S. W. van de Wal’ 


The onset of major glaciations in the Northern Hemisphere about 
2.7 million years ago’ was most probably induced by climate cool- 
ing during the late Pliocene epoch”’. These glaciations, during 
which the Northern Hemisphere ice sheets successively expanded 
and retreated, are superimposed on this long-term climate trend, 
and have been linked to variations in the Earth’s orbital para- 
meters*. One intriguing problem associated with orbitally driven 
glacial cycles is the transition from 41,000-year to 100,000-year 
climatic cycles that occurred without an apparent change in 
insolation forcing’. Several hypotheses have been proposed to 
explain the transition, both including and excluding ice-sheet 
dynamics*"°. Difficulties in finding a conclusive answer to this 
palaeoclimatic problem are related to the lack of sufficiently long 
records of ice-sheet volume or sea level. Here we use a compre- 
hensive ice-sheet model and a simple ocean-temperature model” 
to extract three-million-year mutually consistent records of sur- 
face air temperature, ice volume and sea level from marine benthic 
oxygen isotopes’”. Although these records and their relative phas- 
ings are subject to considerable uncertainty owing to limited avail- 
ability of palaeoclimate constraints, the results suggest that the 
gradual emergence of the 100,000-year cycles can be attributed to 
the increased ability of the merged North American ice sheets to 
survive insolation maxima and reach continental-scale size. The 
oversized, wet-based ice sheet probably responded to the sub- 
sequent insolation maximum by rapid thinning through increased 
basal-sliding’*", thereby initiating a glacial termination. Based on 
our assessment of the temporal changes in air temperature and ice 
volume during individual glacials, we demonstrate the importance 
of ice dynamics and ice—climate interactions in establishing the 
100,000-year glacial cycles, with enhanced North American ice- 
sheet growth and the subsequent merging of the ice sheets being 
key elements. 

This onset of major glaciations has been inferred primarily from 
analyses of marine oxygen isotope ratio (5'°O) records'®. The glacia- 
tions have been confidently linked to variations in the Earth’s orbital 
parameters—that is, precession of the equinoxes (with periods of 
19,000 and 23,000 years; 19 and 23 kyr), axial obliquity (41 kyr) 
and orbital eccentricity (100 kyr)*. However, difficulties associated 
with the theory of orbitally driven glacial cycles remain. One of the 
more intriguing problems relates to the observation that marine 8'°O 
varied mainly at obliquity timescales before ~1 Myr ago” and at 100- 
kyr timescales thereafter. This transition from 41-kyr to 100-kyr 
climatic cycles is generally known as the mid-Pleistocene transition 
(MPT), and occurred without an apparent change in insolation for- 
cing. Numerous explanations have been proposed for this change, 
such as a nonlinear ice-sheet response to a long-term cooling trend’, 
spatial and temporal changes in ice-sheet substratum’, sea-ice effects 
on ablation and accumulation’, orbital-inclination forcing induced 


by extraterrestrial dust’, and a change in the margin of the Antarctic 
ice sheet from land-based to marine-based”’. 

The main difficulty in finding conclusive answers to this long- 
standing palaeoclimatic question is that sufficiently long records of 
ice-sheet volume or sea level are currently unavailable. Hence, exist- 
ing hypotheses are mostly based on benthic 5'*O records. However, 
these are influenced not only by the mean oceanic 8'*°O (governed by 
Northern Hemisphere ice-sheet storage) but also by local deep-water 
temperature!’, and marine 8'°O records may therefore be poor indi- 
cators of ice-sheet fluctuations in terms of their volume’ and phase 
lag owing to the temperature overprint'*”°. 

We have applied a recently developed model-based method" to 
separate these two components, which involves an inverse technique 
in conjunction with an ice-sheet model coupled to a simple deep- 
water temperature model (Supplementary Information and 
Methods). Using the LR04 stack marine 5'°O record’, we have 
reconstructed mutually consistent 3-Myr time series of surface air 
temperature (continental and annual mean between 40° and 80° N), 
ice-sheet volume and sea level. We have estimated the phasing 
between them to deduce causes and effects, and infer how the climate 
variables and phasings have changed over the MPT. These recon- 
structions are subject to considerable uncertainty'’, in particular 
before 0.4 Myr ago, which is due mainly to limited palaeoclimatic 
constraints (Supplementary Information) and to the simplicity of the 
ocean-temperature model (and other assumptions"). 

The reconstructed marine isotope signal increased steadily over 
the past 3 Myr (Fig. 1a), indicative of both climatic cooling (Fig. 1b) 
and growing ice sheets (Fig. 1c) (Supplementary Data). Its amplitude 
(reflecting the contrast between interglacial and glacial states) also 
increased with time, with noticeable changes occurring around 
2.7 Myr ago” indicating the onset of Northern Hemisphere gla- 
ciation (Fig. 1c), and again around 1 Myr ago (MPT). The inferred 
overall 3-Myr cooling trend is generally consistent with the gradual 
cooling of alkenone-derived equatorial sea surface temperature from 
deep-sea sediment cores’, although our mid-latitude/subarctic cool- 
ing is stronger, most probably implying an increase in meridional 
temperature gradient over the 3-Myr period. Our reconstructed 
deep-sea cooling of about 2—2.5 °C agrees with Mg/Ca deep-sea tem- 
perature estimates* (Supplementary Information). 

According to our calculations, the inception of the Northern 
Hemisphere ice sheets at about 2.7 Myr ago may be linked to air 
temperatures dropping below —5 °C (relative to the present), with 
summer temperatures over inception-sensitive regions being the crit- 
ical factor’. The modelled timing of Northern Hemisphere inception 
coincides with the first occurrence of ice-rafted debris in the North 
Atlantic”’, which indicates calving ice sheets bordering the ocean. The 
amplitude of the temperature fluctuations hardly increased (Fig. 1b) 
during this inception period, suggesting that the emerging Northern 


'Royal Netherlands Meteorological Institute (KNMI), Wilhelminalaan 10, 3732 GK De Bilt, The Netherlands. “Institute for Marine and Atmospheric Research Utrecht (IMAU), Utrecht 
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Hemisphere ice sheets accounted for the enhanced 5'%O variability 
around 2.7 Myr ago. 

Following Northern Hemisphere glacial inception, a more gradual 
cooling facilitated steady ice-sheet expansion over the past 2.5 Myr 
(in the long-term average; thick lines in Fig. la, b). Before the MPT, 
the Eurasian (EAS) ice sheets contributed most to the total Northern 
Hemisphere ice volume; then, with the appearance of 100-kyr gla- 
cials, the North American (NAM) ice sheets took over. According to 
our model calculations, the EAS and NAM ice sheets on average 
stored 19.0 and 15.8m sea level equivalent (m.s.Le.), respectively, 
between 1 and 2 Myr ago, and 25.9 and 29.6m.s.le. in the past 
1 Myr, suggesting that the MPT somehow involved a change in 
EAS and NAM ice-sheet characteristics. With the emergence of the 
100-kyr cycles during the MPT, the 8'%O signal again exhibits a 
marked increase in amplitude. Our results suggest that this can be 
attributed mainly to an increase in the glacial—interglacial range of ice 
volume and global sea level (Fig. 1c), as air temperature fluctuations 
exhibit only a modest increase in amplitude. 

Most theories about the causes and origin of ice ages (glacials), 
including those involving the MPT, rely heavily on assumptions 
regarding the phasing of temperature, ice volume and 8'%O fluctua- 
tions relative to each other and to insolation components'®”’. Here 
we estimate these phasings using Blackman—Tukey cross-spectral 
analysis to determine the phase and coherence spectra relative to 


100-kyr glacials 
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the 21 June insolation at 65° N. The results suggest that temperature 
exhibited a relatively small lag relative to insolation, which increased 
from 1.5 kyr at 2.5 Myr ago to 4.5 kyr at present (Fig. 2a), presumably 
caused by the thermal effects of expanding ice sheets. Because tem- 
perature and CO, most probably varied roughly in phase’, our 
results indicate that obliquity forced Northern Hemisphere tempera- 
tures—most probably through mechanisms associated with the glo- 
bal carbon cycle—then drove the Northern Hemisphere ice sheets. 
Global ice volume lagged behind insolation and temperature by sev- 
eral millennia, with the EAS ice sheets exhibiting the slowest response 
because of colder and drier ambient conditions. 

The average lag of ice volume behind surface air temperature 
decreased from about 6kyr at 2.5 Myr ago to slightly more than 
3 kyr at present (Fig. 2a), with the strongest decrease occurring after 
the MPT. According to our calculations, the Northern Hemisphere 
needed to cool about 5 °C before ice-sheet inception occurred. Also, 
ice sheets cannot expand faster than the rate at which mass is gained 
through snow accumulation. Both factors contribute to ice volume 
lagging temperature by 5—7 kyr during inception stages, both for 41- 
kyr and 100-kyr glaciations, indicating that ice-sheet growth merely 
followed the temperature signal. The only exception to ice volume 
lagging air temperature involves 100-kyr deglaciations; the associated 
lag dropped to about zero during the MPT (Fig. 2b). This suggests the 
existence of two ‘regimes’: before the MPT, deglaciations of small ice 


41-kyr glacials NH inception 


Isotopic value (%o) 
relative to present 


a TT Ui) 
mans 


wee |ce-sheet component 
=== Deep-water temperature component 


Surface air temperature (°C) & 
relative to present 


° 


fi 
Det i il 


Ice volume (m.s.l.e.) 
relative to present 


0.5 1.0 


1.5 


2 
1s) 
1 § 
59 
Te 0 BS 
i 8 3 
Wik, $3 
iil 38 
2 88 
ao 
-3 is) 

-4 


wee Furasia 
eee North America 
ee Global 


2.0 2.5 3.0 


Time (Myr BP) 


Figure 1| Three-million-year time series of marine oxygen isotope values, 
subarctic surface air temperature and ice volume. Thick lines represent 
400-kyr running means. a, Input marine oxygen isotope (5'°O) stack 
record”” with the present-day value of 3.22%o subtracted (red), and the 
reconstructed ice-sheet (blue) and deep-water temperature (green) 
components. b, Reconstructed surface air temperature (annual average 
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Northern Hemisphere continents between 40° and 80° N) and deep-ocean 
temperature. c, Reconstructed sea level or global ice volume (green), and the 
Eurasian (red) and North American (blue) contributions. Coloured lines/ 
arrows along top represent various stages of glaciation. MPT, mid- 
Pleistocene transition; NH, Northern Hemisphere; Bp, before present; 
m.s.l.e., m sea level equivalent. 
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Figure 2 | Phase lags of input/reconstructed variables, as evaluated using 
Blackman-Tukey cross-spectral analysis with 30% lags. Data points 
represent 500-kyr windows. a, Phase lag at 41 kyr (obliquity) of Northern 
Hemisphere surface air temperature, Eurasian (EAS) ice volume, North 
American (NAM) ice volume, global sea level or total ice volume, and total 
marine oxygen isotope (3!80) value, relative to insolation (65° N, 21 June). 
For the obliquity spectral component, the coherence is always above the 95% 
confidence level for all variables considered, whereas the reconstructed 
records are generally incoherent with respect to the precession component 
and also to the eccentricity components (before 1.2 Myr ago). As the tuning 
of the LR04 stack emphasizes obliquity, the associated lag is primarily 
determined by the assumed time constant of the simple ice model that was 
used for tuning”. The lag of 5'°O relative to insolation increased between 2.5 
and 1.5 Myr ago, after which it remained largely constant, as this was 
prescribed as such in the tuning of the LR04 stack’*. Lags and leads of all 
other variables relative to 5'*O are independent of this tuning procedure, 
but are most probably associated with considerable uncertainty owing to the 
limited availability of palaeoclimate constraints and to the simplicity of 
mainly the ocean model’’. b, Upper four curves (left axis) show phase lag of 
global ice volume to Northern Hemisphere surface air temperature for 41- 
kyr inception and deglaciation stages (light and dark blue) and for 100-kyr 
inception and deglaciation stages (red and orange). Lower two curves (right 
axis) show spectral power of 100-kyr Northern Hemisphere surface air 
temperature (black) and global sea level or ice volume (green) variations. 


sheets linearly followed temperature, whereas after the MPT the huge 
ice sheets were actively involved in the rapid, nonlinear deglaciations 
to terminate the 100-kyr glacials. Even though they were ultimately 
triggered by insolation, inceptions were probably governed by non- 
ice-sheet climate feedbacks to decrease temperature below the 
threshold of ice-sheet inception, whereas glacial terminations pre- 
sumably actively involved the rapid break-up of vulnerable large ice 
sheets'**, both agreeing with phase relationships over the last 360 kyr 
derived from Antarctic ice cores”. 

We note that the resulting phasings of reconstructed variables 
relative to insolation are largely determined by the choices made to 
construct the age model of LR04 (which determines the phase 
between 5'%O and insolation). However, the 1.5—4.5-kyr lag between 
reconstructed air temperature and insolation (Fig. 2a) agrees well 
with the response timescale of the important feedback mechanisms 
in the climate system (ocean, carbon cycle, ice sheets), suggesting that 
the phasings assumed in LR04 are realistic. Also, the reconstructed 
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Figure 3 | Time series of ice volume and Northern Hemisphere subarctic 
surface air temperature between 1.5 and 0.5 Myr ago, across the MPT. Left 
axis, ice volume; global, Eurasian (EAS) and North American (NAM) 
contributions are shown as black, red and dark blue curves, respectively. 
Right axis, Northern Hemisphere subarctic surface air temperature (green 
curve). Vertical black arrows depict inferred 100-kyr interglacials. 


Horizontal dark blue bars represent the first NAM ice volume minimum 
following a 100-kyr interglacial. The horizontal light blue line represents the 
45 m.s.Le. level beyond which the Cordilleran and Laurentide ice sheets in 
North America merged (Supplementary Information), which occurred only 
after 1 Myr ago. Orange lines connect maxima in global ice volume within a 
single 100-kyr glacial, highlighting increasing ice volumes during a glacial. 
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lags to insolation vary in time in a manner not prescribed by LR04. 
These and other considerations (see Supplementary Information) 
indicate that our analysis provides a picture of the phasings between 
climate variables and insolation that is consistent overall. 

Our results support other analyses in showing that the MPT repre- 
sents a fairly gradual increase in 100-kyr power (Fig. 2b) from 
(roughly) 1.4 to 0.6Myr ago*’*’*. An important feature is the 
increased inability of the system to reach full interglacial levels at 
the obliquity and precession timescales (Fig. 3), in particular in 
North America where continuing cooling enabled ice sheets to over- 
come insolation maxima at ever greater volumes®™*. This produced 
further expansion and eventual merging (at about 45 m.s.le.) of the 
Cordilleran and the Laurentide ice sheets. In Eurasia, the post- MPT 
ice volume was still limited to about 40 m.s.Le., presumably because 
the Atlantic Ocean in the west and the dry and cold Siberian plains to 
the east prevented further expansion of the Fennoscandian ice 
sheet’? 

The appearance of full-grown, merged ice sheets in North America 
coincided with the deglaciation lag between surface air temperature 
and ice volume being reduced to about zero (Fig. 2b), suggesting a 
leading role for the NAM ice sheets in glacial terminations. We argue 
that this large NAM ice volume initiated rapid deglaciations invol- 
ving large ice cap instabilities'”’’, because similar insolation maxima 
apparently failed to trigger deglaciations of less extensive ice sheets. 
Thick, wet-based ice sheets are particularly sensitive to climate warm- 
ing’*". Hence, once the NAM ice sheets reached a certain threshold 
size (about 70 m.s.le.) during the post-MPT glacials, the following 
insolation maximum and associated air temperature peak triggered a 
rapid thinning of the ice sheet through increased sliding rates”*, pre- 
sumably via fast-flowing ice streams and possibly enhanced by abla- 
tion, sea level and ocean circulation feedbacks®'®”? (Supplementary 
Information). 

Even though our 3-Myr reconstructions of temperature, ice 
volume and sea level should be considered first-order estimates— 
mainly owing to the limited availability of constraining palaeocli- 
mate data—they nevertheless suggest that differences between the 
NAM and EAS ice sheets were central to the gradual emergence of 
the 100-kyr glacials between 1.4 and 0.6 Myr ago. Whereas the dry 
and sluggish EAS ice sheets dominated before the MPT, increased 
climate cooling enabled the NAM ice sheets to expand and eventually 
combine into one large ice sheet that ultimately became unstable, 
triggering rapid glacial terminations. The duration of the post-MPT 
glacials of 80-120 kyr (2-3 obliquity cycles) was essentially governed 
by the rate at which the NAM ice sheets expanded. According to this 
hypothesis, specific ice dynamics and ice—climate interactions related 
to the North American ice sheets played a crucial role in establishing 
the 100-kyr glacial cycles. 


METHODS SUMMARY 


The key to separating the LR04 stack marine 5'*O record’ into actual climate 
data is the observation that the two main components affecting the marine 5'°O 
value (storage in ice sheets and deep-water temperature) both rely heavily on 
surface air temperature (with regard to long-term changes). By using models that 
describe these relationships (an elaborate ice-sheet model to link air temperature 
to ice volume and ice 5'O storage, and a simple ocean-temperature model to 
couple deep-water temperature to surface air temperature) in conjunction with 
an inverse methodology, we were able to quantify the two components" for the 
3-Myr LRO04 stack marine 8'%O record. In the process, 3-Myr-long mutually 
consistent records of atmospheric temperature, ice volume and global sea level 
were reconstructed. Using Blackman—Tukey cross-spectral analysis, we esti- 
mated the phasings between these climate records in order to assess causes 
and effects during glacial fluctuation, mainly for inception and termination 
stages. Even though the Blackman—Tukey method has its drawbacks in terms 
of accuracy, we decided to use it to compare our phasings with those of LRO4 
(where it was also used). The main problem with new long records such as those 
presented here is to find sufficiently long and independent records for validation. 
A comparison between our reconstructed records and existing independent 
proxy records is presented in the Supplementary Information. 
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Inner-core shear-wave anisotropy and texture from 
an observation of PKJKP waves 


James Wookey’ & George Helffrich' 


Since the discovery of the Earth’s core a century ago’, and the 
subsequent discovery’ of a solid inner core (postulated to have 
formed by the freezing of iron’*) seismologists have striven to 
understand this most remote part of the deep Earth. The most 
direct evidence for a solid inner core would be the observation 
of shear-mode body waves that traverse it, but these phases are 
extremely difficult to observe. Two reported observations in short- 
period data*® have proved controversial®. Arguably more success- 
ful have been studies of longer-period data®’, but such averaging 
limits the usefulness of the observations to reported sightings. We 
present two observations of an inner-core shear-wave phase at 
higher frequencies in stacked data from the Japanese High- 
Sensitivity Array, Hi-Net*. From an analysis of timing, amplitude 
and waveform of the ‘PKJKP’ phase we derive constraints on 
inner-core compressional-wave velocity and shear attenuation at 
about 0.3 Hz which differ from standard isotropic core models’. 
We can explain waveform features and can partially reconcile the 
otherwise large differences between core wavespeed and attenu- 
ation models that our observations apparently suggest if we invoke 
shear-wave anisotropy in the inner core. A simple model of an 
inner core composed of hexagonal close-packed iron with its c axis 
aligned perpendicular to the rotation axis" yields anisotropy that 
is compatible with both the shear-wave anisotropy that we observe 
and the well-established 3 per cent compressional-wave aniso- 
tropy. 

Our primary observation comes from a single event which 
occurred on 22 February 2006 in Mozambique, at an epicentral dis- 
tance of 113.7° degrees from the centre of Hi-Net* (Fig. 1). This was a 
shallow event with a moment magnitude M,, = 7.0. For such a large, 


Figure 1| Source, ray path and receiver geometry. We searched for 
evidence of PKJKP in records at the Japanese Hi-Net array (inset) of the 
M, = 7.0 shallow event (depth ~14km) on 22 February 2006 in 


shallow event the source time function is remarkably impulsive (see 
Supplementary Fig. 1). Phase-weighted stacks'’ (Fig. 2) show very 
clear PKKPab, pPKKPab, PKiKP and pPKiKP arrivals, and weak 
PKKPbc and pPKKPbc. The separation between the primary arrivals 
and the depth phases best fit an event 14 km deep. Figure 2e shows the 
time-slowness window for PKJKP as predicted by the Earth model 
ak135 (ref. 9) anda clear arrival is observed. We are also able to match 
the arrival with synthetics (Fig. 3). As this phase arrives very close to 
the ak135 prediction, we suggest that this Earth model provides an 
accurate average inner-core shear-wave velocity. We also have made 
a second observation of PKJKP, from a 2007 event on the mid- 
Atlantic ridge, but a complex wavetrain precludes further useful ana- 
lysis (see Supplementary Figs 7-8). 

The PKJKP waveform presented in Fig. 3 shows another interest- 
ing feature: an arrival about 7s after PKJKP (and 3s after pPKJKP). 
This cannot be sPKJKP or another near-source converted phase, as 
no such phase is observed for PKKPab or PKiKP, which have very 
similar upper-mantle ray paths (Fig. 2). Furthermore, sPKJKP would 
leave a shallow event almost vertically, so the conversion to a P-wave 
at the free surface would be extremely weak. We interpret this sec- 
ondary phase as the effect of inner-core seismic anisotropy (see 
Supplementary Information). Seismic anisotropy is the variation of 
seismic wavespeed with direction, which has been observed for the 
inner core using P-wave and normal mode data (see, for example, ref. 
12 and references therein). When a shear-wave encounters an aniso- 
tropic medium it is split into two components whose polarizations 
are defined by the symmetry of the medium, and which will separate 
in time as they propagate through the region. In our scheme, this 
would lead to two distinct PKJKP phases (which we denote PKJKP1 


Mozambique. The epicentral distance to the centre of the array is 113.7°. The 
right panel shows the ray paths for PKKPab, PKiKP and PKJKP at this 
distance (straight lines are P-wave segments, wavy lines are S-wave). 
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and PKJKP2). This has been predicted’*"’, but never before observed. 
Figure 3 shows the degree of splitting expected for shear-wave aniso- 
tropy between 0 and 2%; to explain the delay we observe requires an 
anisotropy of ~1% averaged across the propagation path through 
the inner core. 

The simplest model proposed to explain the bulk anisotropy of the 
inner core is the alignment of iron crystals'*’*. It has even been 
suggested that the inner core might be a single crystal of hexagonal 
close-packed (h.c.p.) iron’. The anisotropy that we measure is small 
compared with those predicted for single-crystal iron at inner-core 
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Figure 2 | Seismic data. a, b, Vespagrams for the PKKPab and PKiKP 
time—slowness windows, respectively. The colour scale represents the 
amount of energy across all the traces at a given time and slowness. These 
vespagrams are calculated using a phase-weighted slant stack’. Crosshairs 
denote predicted times and slowness from ak135 (ref. 9) for various core 
phases. Clear maxima associated with PKKPab, pPKKPab, PKiKP and 
pPKiKP arrivals are visible, with weaker maxima for PKKPbc and pPKKPbc. 
c, Time window for PKKPab in the unstacked data. Because PKKPab is 
clearly visible we use it as reference phase to calculate a receiver-side static 
time correction*’, and d shows this correction applied to PKKPab. 
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conditions’®'* (up to ~20%; see Supplementary Fig. 10). To explain 


our inferred shear-wave anisotropy as well as the much better- 
established P-wave anisotropy, we have explored a range of simple 
textural models for these elastic constants. The models are a series of 
orientations and rotational averages of h.c.p. and body-centred cubic 
(b.c.c.) iron, to provide an estimate of the aggregate elasticity (and 
hence anisotropy). These models are motivated by different ideas of 
crystal alignment due to dendritic growth during solidification’*”® or 
post-solidification deformation'’®'”'’. Figure 4 outlines these 
mechanisms. Models tested are given in Supplementary Table 1 
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e, Time-slowness window (relative to the PKKPab reference phase) where 
PKJKP is predicted to arrive. A clear maximum can be seen ~1.5 s before the 
prediction, at the correct slowness to within the resolution of the array 
(~0.05 s per degree). There is also energy near the time predicted for 
pPKJKP, although this is low amplitude (near the noise level) and poorly 
constrained in slowness. f, Azimuthal slant stack, at a fixed slowness of 2.6 s 
per degree. This shows that the maximum identified as PKJKP arrives within 
0.2° of the (major arc) great circle path. A second peak is also seen at the 
slowness of PKJKP (denoted by the question mark). 
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Figure 3 | Real (top) and synthetic waveforms (bottom). These are phase- 
weighted slant stacks at the peak slownesses for PKKPab and PKJKP. The 
shaded area is the envelope function of the trace. Given the uncertainty in 
relative amplitudes from moment tensor solutions’, in order to match the 
observed amplitude ratio of between PKKPab and pPKKPab in the real data, 
a taper of value 0.25 is applied to the PKKPab synthetics after the primary 
arrival. The same taper is also applied to the PKJKP synthetics. The PKJKP 
phase observed is the Hilbert transform of the reference PKKPab phase, as 
predicted. An arrival is also apparent near the correct time for pPKJKP, 
although this is very close to the noise level of the trace and not well 
constrained. The arrival denoted by the question mark may be due to shear- 
wave splitting by inner-core anisotropy. The scale bar shows the time lag 
predicted for a model of uniform anisotropy, implying that ~1% is required 
to explain this phase. 


and Supplementary Fig. 11. A first-order picture of inner-core aniso- 
tropy shows that P-wave phases on polar raypaths traverse the region 
~3% faster than those on equatorial paths’. The only models tested 
that even roughly match this constraint involve the rotational average 
of h.c.p. Fe with the caxis perpendicular to the Earth’s rotation axis’®. 
These models also show a shear-wave anisotropy that is roughly 
compatible with that inferred from our observation of PKJKP. 
Thus, the measured anisotropy is currently non-unique with regard 
to its cause (that is, depositional or deformational) but does suggest 
iron with h.c.p. structure to be a better prospect than iron with b.c.c. 
structure. Previous observations of inner-core shear-wave aniso- 
tropy come from inversions of normal-mode observations’””°, which 


Figure 4 | Possible causes of crystal alignment in the inner core. 

a, Compressional and tensional deformation due to non-uniform growth. 
Outer-core convection in Taylor columns” leads to larger equatorial heat- 
flux, promoting freezing at the ICB in these regions. This oblateness is 
dynamically unstable, leading to deformation” symmetrically about the 
rotation axis (Q). CMB, core—mantle boundary. b, Crystal alignment due to 
dendritic solidification. As liquid iron freezes onto the ICB, dendrite 
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predict larger absolute anisotropies. The 7-s splitting we observe for 
PKJKP is, in fact, compatible with these (see Supplementary 
Information), as such models predict a change in sign of anisotropy 
at a radius between 400 and 800 km (see ref. 20), leading to a relatively 
small aggregate anisotropy along the ray path. Mechanisms to explain 
such variation in anisotropy, however, are more complex than the 
simple textural models proposed above. 

Also of interest is the strength of the PKJKP arrival, which we 
measure using the ratio of the recorded amplitudes PKJKP and 
PKiKP (hereafter Ry;) in linear slant stacks. In our data, 
Ry, = 0.14 + 0.05, significantly larger than Ry; = 0.02 predicted 
using reflectivity modelling for a model compatible with ak135. 
This could be due to either an increased PKJKP amplitude or a 
reduced PKiKP amplitude. Inner-core anisotropy might affect 
PKJKP and PKiKP amplitudes by altering the reflection and coeffi- 
cients at the inner-core boundary (ICB). For example we find, using 
elastic constants from ref. 14, that the ratio of PKJKP to PKiKP 
amplitude can vary significantly: up to a factor of 25 depending on 
the degree of alignment and orientation at the ICB (see 
Supplementary Fig. 14). This has also been suggested to explain 
observations of anomalous amplitudes of PKPdf phases”’. As this 
effect alone explains both waveform features and the observed Ry; 
without requiring modification of the core’s isotropic properties, we 
favour it. But to test whether we can in fact reconcile our observation 
with an isotropic, spherically symmetric model of the inner core, we 
have also run a suite of synthetics varying the seismic velocity (Vp and 
Vs), density and Qs (the shear-wave ‘quality’ factor, also sometimes 
called Q,,) in the inner core (see Supplementary Information). We 
find that Ry; depends most strongly on Vp of the inner core at the ICB 
(predominantly through the amplitude of PKiKP), with a weaker but 
significant dependence on Qs (through the amplitude of PKJKP). 
The influence of Vs and density (at least within bounds compatible 
with normal-mode data) is negligibly weak. Supplementary Fig. 13 
shows the variation in Rj; with inner-core Vp and Q,, identifying the 
distribution of models consistent with our measured Rj;;. This shows 
that to reconcile our measured Ry; with an isotropic one-dimen- 
sional model of the inner core requires a much higher Qg than is 
inferred from normal-mode studies, a considerably reduced Vp (at 
least near the ICB), or a more moderate combination of the two. For 
example, in order that Qs be compatible with the normal-mode 


structures might be formed’’ and could persist deep into the inner core. In 
this case crystals would be oriented relative to the dendrite long axes (d), 
which we assume to be perpendicular to Q. Solid-state, inward axial flow 
modifies orientation (Supplementary Table 1 and Fig. 11). ¢, Alignment due 
to Maxwell stresses (after ref. 18). In this model, stresses exerted by the 
Earth’s magnetic field (B) re-orient crystals of inner-core iron, leading to 
large-scale texturing. 


875 


©2008 Macmillan Publishers Limited. All rights reserved 


LETTERS 


measurements of ref. 22, the P-wave velocity increase at the ICB must 
be of the order of 0.3kms ' (compared with 0.75 km s | inak135). 
A global degree-1 asymmetry of P-wave speed (with a deviation from 
the reference model of —0.7 to +0.2%) in the outermost part of the 
inner core has been documented”*™, but this variation is much smal- 
ler than our observations require. A strong variation in PKiKP ampli- 
tude has also been observed by using P or PcP as a reference 
phase**”’. It is also possible, however, that attenuation values derived 
from normal modes are not appropriate at body-wave frequencies 
(see refs 28, 29 and Supplementary Information). The amplitude 
spectrum of the PKJKP (see Supplementary Fig. 9) also suggests that 
Q, in the inner core might be higher than previously estimated. This 
is hard to reconcile with measurements of P-wave Q from PKIKP**”’, 
without significant bulk attenuation, and perhaps a layered Q struc- 
ture in the inner core. The influence of scattering attenuation may 
also be important”. These issues are discussed further in the 
Supplementary Information, but drawing firm conclusions as to 
the causes will require more observations than provided by a single 
measurement. There are several factors that might influence the 
amplitudes of PKJKP and PKiKP beyond those that may be tested 
by our modelling methodology, including the focusing/defocusing 
effects of ICB topography or inner-core velocity heterogeneity. 
However, we believe that the simplest explanation remains the effect 
of anisotropy. 

To test these ideas further requires more PKJKP observations at 
different traversal directions across the inner core. We believe that as 
the new generation of large-aperture dense array experiments such as 
Hi-Net and the nascent USArray begin to accumulate data, PKJKP 
should become a more routinely detected phase. Such accumulated 
observations will probe other areas of the inner core to assess the 
possible hemispheric variation in its shear anisotropy; our observa- 
tions are confined to the quasi-western hemisphere™. This will provide 
stronger constraints on the depth extent of texturing and inner-core 
growth through time. 


METHODS SUMMARY 


Our data comprise 704 short-period Hi-Net* records from borehole seism- 
ometers sited around the Japanese Islands. We use PKKPab as a reference phase 
to calculate a receiver-side static time correction”. Data are initially bandpass 
filtered between 0.05 and 2 Hz, to remove high-frequency noise. A power-2 
phase-weighted slant stack'! is applied and the data are bandpass filtered again 
between 0.05 and 0.5 Hz to smooth the spectra, and a trace envelope function is 
derived. Figure 2e shows the time-slowness window for PKJKP as predicted by 
ak135 (ref. 9). A clear energy pulse can be observed at a slowness which is 
indistinguishable from the model-predicted slowness, and ~1.5s earlier than 
the predicted arrival time. Azimuthal stacking shows a deviation of <0.2° from 
the major arc great circle path (Fig. 2f). We interpret this phase to be PKJKP. We 
are also able to match the arrival with reflectivity method"! synthetics for a one- 
dimensional isotropic Earth model’ (Fig. 3). The PKJKP phase observed is the 
Hilbert transform of the reference PKKPab phase, as predicted. Further tests are 
detailed in the Supplementary Information. For amplitude measurements, linear 
slant stacks are used to avoid biasing by phase weighting. Amplitude error bounds 
are estimated from the signal-to-noise ratio of the PKJKP peak; those of PKiKP are 
negligible in comparison. A correction is also made for the moment tensor. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Our data comprise 704 short-period Hi-Net* records from borehole seism- 
ometers sited around the Japanese Islands. As the PKKPab arrival is very clear, 
we use it as a reference phase to calculate a receiver-side static time correction*’. 
Data are initially bandpass filtered between 0.05 and 2 Hz, to remove high- 
frequency noise. A power-2 phase-weighted slant stack" is applied and the data 
are bandpass filtered again between 0.05 and 0.5 Hz to smooth the spectra. We 
assume a linear moveout between PKJKP and PKKPab in slant stacking. The 
maximum deviation from this is of the order of 0.1s, much less than the dom- 
inant period of the phase, and may be neglected. Finally the trace envelope 
function is derived. Figure 2e shows the time-slowness window for PKJKP as 
predicted by ak135 (ref. 9). A clear energy pulse can be observed at a slowness 
which is indistinguishable from the model-predicted slowness (we calculate the 
slowness resolution of the array to be about 0.05 s per degree; see Supplementary 
Fig. 2), and ~1.5s earlier than the predicted arrival time. We interpret this phase 
to be PKJKP. Other phases that have previously been identified as problematic 
(for example, PcPPKIKP, PcPPKiKP and PKKPdf; see refs 4, 5, 7) are predicted 
to arrive well separated in time or slowness or both. Further evidence for the 
arrival being PKJKP comes from slant stacking with varying azimuth, with a 
fixed slowness of 2.6 s per degree relative to PKKPab (Fig. 2f). This shows that the 
arrival has maximum amplitude at a deviation of less than 0.2° from the great 
circle path and that it arrives on the major arc. We are also able to match the 
arrival with synthetics. We use reflectivity method"! synthetics for a one-dimen- 
sional isotropic Earth model’. These are processed identically to real data, except 
a static correction is unnecessary. Figure 3 shows the comparison of the slant- 
stacked waveform with synthetics. The PKJKP phase observed is the Hilbert 
transform of the reference PKKPab phase, as is predicted by the synthetics. A 
weak phase is also observed at the correct time for a source 14km deep as 
predicted by PKKPab and PKikP. Further tests are detailed in the 
Supplementary Information. To compare the shear-wave splitting we infer with 
previous, normal-mode derived models we calculate** accrued lag-time between 
SH and SV phases, assuming no perturbation of the isotropic ray path. This is 
done for the minimum, maximum and mean models of ref. 20. We find that the 
7-s splitting we measure is compatible with the range of normal-mode models 
(see Supplementary Information). For amplitude measurements, linear slant 
stacks are used to avoid biasing by phase weighting. Amplitude error bounds 
are estimated from the signal-to-noise ratio of the PKJKP peak; those of PKiKP 
are an order of magnitude smaller because of a much larger amplitude. A cor- 
rection is also made for the moment tensor. 


34. Thomsen, L. Weak elastic anisotropy. Geophysics 51, 1954-1966 (1986). 
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Inapparent infections and cholera dynamics 


Aaron A. King’, Edward L. lonides®, Mercedes Pascual'’* & Menno J. Bouma? 


In many infectious diseases, an unknown fraction of infections 
produce symptoms mild enough to go unrecorded, a fact that 
can seriously compromise the interpretation of epidemiological 
records. This is true for cholera, a pandemic bacterial disease, 
where estimates of the ratio of asymptomatic to symptomatic 
infections have ranged from 3 to 100 (refs 1-5). In the absence 
of direct evidence, understanding of fundamental aspects of chol- 
era transmission, immunology and control has been based on 
assumptions about this ratio and about the immunological con- 
sequences of inapparent infections. Here we show that a model 
incorporating high asymptomatic ratio and rapidly waning 
immunity, with infection both from human and environmental 
sources, explains 50 yr of mortality data from 26 districts of 
Bengal, the pathogen’s endemic home. We find that the asympto- 
matic ratio in cholera is far higher than had been previously sup- 
posed and that the immunity derived from mild infections wanes 
much more rapidly than earlier analyses have indicated. We find, 
too, that the environmental reservoir*® (free-living pathogen) is 
directly responsible for relatively few infections but that it may be 
critical to the disease’s endemicity. Our results demonstrate that 
inapparent infections can hold the key to interpreting the patterns 
of disease outbreaks. New statistical methods’, which allow rig- 
orous maximum likelihood inference based on dynamical models 
incorporating multiple sources and outcomes of infection, season- 
ality, process noise, hidden variables and measurement error, 
make it possible to test more precise hypotheses and obtain unex- 
pected results. Our experience suggests that the confrontation of 
time-series data with mechanistic models is likely to revise our 
understanding of the ecology of many infectious diseases. 

Cholera is a diarrhoeal disease caused by enteric infection with the 
bacterium Vibrio cholerae. Six of the seven cholera pandemics that 
have swept the globe since 1817 originated in the low-lying, densely 
populated regions north of the Bay of Bengal, where the disease is 
endemic. Although much attention has been focused on cholera’, 
unsolved puzzles remain about its mode of transmission and the role 
of host immunity in its dynamics. This is largely because, in regions 
where cholera is endemic, most cholera cases are mild or asympto- 
matic but the true extent of asymptomatic infection has been difficult 
to assess. Estimates of the ratio of asymptomatic to symptomatic 
cases vary greatly, and the importance of inapparent infections in 
the dynamics of cholera outbreaks is unknown. To determine what 
role is played by inapparent infections, we used an approach that 
allows indirect inference about unobserved variables. 

A remarkably rich data set on the pattern of cholera epidemics 
exists in the form of mortality records kept by the sanitary commis- 
sioners of the former British East Indian province of Bengal’. The 
data consist of monthly cholera death counts in each of 26 districts 
over the period 1891-1940 (Supplementary Fig. 1). To analyse these 
data, we formulated a series of models incorporating known 
or hypothesized mechanisms of transmission and immunity. A 


parsimonious model for cholera dynamics is of susceptible— 
infectious—recovered—susceptible (SIRS) form (Fig. 1a). A novel fea- 
ture of this model is that it incorporates both transmission tied to 
human prevalence (using a traditional mass-action term) and trans- 
mission from an environmental reservoir (where the pathogen is 
commonly living in aquatic environments)**'*"'?. This model is a 
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Figure 1| The mechanistic models used. a, SIRS model; b, two-path model; 
c, environmental-phage model. Births, related to the total population size H, 
are assumed to feed the pool of susceptibles, S. Individuals are susceptible to 
infection when born. Exposure to the pathogen occurs at time-dependent 
rate /(t). cis the probability that an exposure leads to a contagious infection 
(class I). Note that when c = 1 and p = , the two-path model (b) reduces to 
the SIRS model (a); when c < 1, some exposures result in short-term 
immunity (class Y). Infected individuals die at an excess rate m and recover 
at a rate y; the time an individual spends within the I class is exponentially 
distributed. We assume that an individual remains immune to reinfection 
for a duration gamma-distributed with mean 1/e and variance 1/ke”. Once 
immunity has waned, an individual re-enters the susceptible pool (S). The 
measured variable is monthly deaths, M. The mean duration of short-term 
immunity is 1/p. Individuals in each class are subject to constant 
background mortality at rate 0.02 yr_'. The force of infection, /(£), includes 
terms for environmental and human sources of infection and is assumed to 
vary seasonally. Because the seasonality of cholera dynamics in Bengal is 
complex, we used a semi-mechanistic approach: transmission was modelled 
by a flexible periodic function of time. In the environmental-phage model 
(c), as infected hosts shed pathogen, phage @ builds up in the environment 
and reduces transmissibility. The equations specifying these models are 
given in the Supplementary Equations. 
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partially observed, nonlinear, stochastic dynamical system in con- 
tinuous time and as such is not amenable to analysis by standard 
statistical techniques. Inference for such systems has recently been 
facilitated by a new likelihood maximization procedure’ which 
allows for measurement error, non-stationarity, irregular sampling 
intervals and the mechanistic inclusion of covariates (see Methods). 
We fitted the SIRS model to each district’s data, assessed its explan- 
atory power using log likelihood and the Akaike information cri- 
terion (AIC,) and compared it with a semi-mechanistic time-series 
model for cholera’? and two non-mechanistic time-series models. 

The SIRS model explained the data dramatically better than the 
previous best fit!’ (AAIC, > 58; Supplementary Table 1). Parameter 
estimates reveal some surprises. A prediction of the model, robustly 
consistent across districts, is that immunity must wane on a timescale 
of weeks to months (Fig. 2 and Supplementary Table 3). This is in 
stark contrast to the widely held belief that infection-derived immun- 
ity to cholera wanes on a timescale of 3-10 yr (refs 13-17). The model 
also robustly predicts low case fatality (0.004 + 0.002, mean = s.d. 
across 26 districts). Because fatality among hospitalized cases was 
historically in excess of 50%’*, this implies a very large asymptomatic 
ratio. The implication is that most exposures result not in severe 
cholera, but in mild or asymptomatic infection, the immunological 
memory of which is short-lived. This prediction of a high asympto- 
matic ratio is consistent with the only intensive field studies of inap- 
parent infections of which we are aware**. However, because in this 
model all infected individuals are equally infectious, the prediction is 
that the vast majority of infectives are ‘silent shedders’—infectious 
but without symptoms. The evidence for such a conclusion is mixed 
but weak at best”. Moreover, because in this model all infections are 
alike, it is unclear whether the improved dynamical description 
afforded by the SIRS model over the previous best fit is due to the 
higher proportion of asymptomatic infections or to the shorter dura- 
tion of immunity. 

To tease apart the immunological consequences of exposure to the 
pathogen from the degree of infectiousness induced, and to allow for 
heterogeneity in infectiousness, we formulated a second mechanistic 
model in which exposed individuals can follow an alternate pathway, 
deriving immunity to infection while shedding a negligible quantity 
of vibrio and suffering a negligible disease-induced mortality 
(Fig. 1b). In essence, this model allows for the possibility that expo- 
sure to the pathogen effectively vaccinates against reinfection. We 
leave it to the data to determine how frequently such natural vac- 
cination occurs and how long the resulting immunity lasts. 

Under the two-path model, the timescale of short-term immunity 
(9.9 + 4.7 weeks, mean + s.d. across 26 districts) agrees with that of 
the SIRS model. The estimated case fatality under this model is 
0.34 + 0.21—consistent with conditions during this period'—and 
survivors of severe infections receive longer-lasting immunity 


Profile log likelihood 
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Figure 2 | Profile likelihood of the duration of immunity, 1/¢, in the SIRS 
model for the data from Dacca district. Dacca (now spelled Dhaka) is the 
only district for which comparable analyses have been performed'*”’.To 
compute the profile likelihood, ¢ is fixed and the likelihood is maximized 
over the remaining parameters. The vertical dashed lines embrace the 
approximate 95% confidence interval (2-12 weeks). 
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(1.5+0.7yr), consistent with experimental evidence'*’>”. 


Although in some districts the evidence strongly favours the two- 
path model, the data taken as a whole are equivocal on the subject of 
which model is better (see Supplementary Table 2). The implication 
is that it is the short-term immunity and high asymptomatic ratio 
predicted by both models, and not a large fraction of silent shedders, 
that accounts for the dramatically improved description afforded by 
our models. 

A second robust prediction of both models is that, in typical epi- 
demics, relatively few cases are due to the environmental reservoir. 
More precisely, though our models cannot speak to the exact route of 
transmission (that is, food-borne, water-borne, fomites, etc.), our 
results clearly indicate the relative importance of the positive feed- 
back associated with the human-associated source of infection over 
the human-independent environmental source of infection. 
Nevertheless, the seasonally averaged basic reproductive number, 
Ro, for human-associated transmission is estimated to be quite low 
(SIRS model, 1.5 = 0.2; two-path model, 1.5 + 0.2). This contrasts 
sharply with the much higher values of Ro recently proposed” and 
suggests, paradoxically, that for V. cholerae, in this the region where 
cholera is most persistent and outbreaks most frequent, humans are 
potentially a marginal habitat: the persistence of the disease may be 
largely due to the environmental reservoir which provides a small 
extrinsic force of infection. The strength of this environmental force 
of infection varies geographically in a pattern (Supplementary Fig. 2) 
that mirrors the earlier suggestion that the coastal regions of 
Bangladesh are the native habitat of classical cholera’'. Both the 
SIRS and two-path models make the simplifying assumption that 
the force of infection due to the environmental reservoir has neg- 
ligible seasonal fluctuation. To relax this assumption”, we fit a third 
model which extends the SIRS model but allows for seasonality in the 
environmental reservoir. This model explains the data significantly 
better in many districts, but our conclusions about low Ro, rapid loss 
of immunity and high prevalence of inapparent infection remain 
unchanged (see Supplementary Information). 

A central challenge in cholera epidemiology is to explain how the 
outbreaks can be so explosive at the outset and yet self-limiting to the 
point that sizable epidemics can recur twice annually. Our explana- 
tion contrasts sharply with those proposed earlier'*’’. Simulations of 
the various models (Fig. 3) make plain the contrasts between the 
explanations. In the previous view, infection-derived immunity 
tends to wane over the scale of years, the explosiveness of outbreaks 
is controlled by spatial effects, and it is the seasonal drop in transmis- 
sibility (presumably related to environmental drivers) that stops the 
epidemics. Because in the previous view the ratio of deaths to infec- 
tions is comparatively modest, the susceptible population is depleted 
only slightly by each epidemic and is continually replenished by 
births and the waning of immunity. In the new view, epidemics are 
again triggered by rising seasonal transmissibility and the presence of 
a reservoir, but the high asymptomatic ratio means that many more 
individuals are exposed. The vast majority of these receive short-term 
protection from infection so that depletion of the susceptible pool 
brings the epidemic to a halt. As immunity wanes, on the timescale of 
weeks to months, the susceptible pool is replenished, setting the stage 
for the next outbreak. In the new view, severe cases play little role in 
shaping the dynamics. 

It has been proposed that the seasonality of cholera in this region 
may be due to the interaction of free-living vibrio with lytic bacterio- 
phage in surface waters’**°. According to this hypothesis, as an epi- 
demic progresses, the density of phage in the environment increases 
and survivorship of vibrio consequently declines, which in turn leads 
to attenuation of transmissibility. To determine whether such an 
effect might explain the data, we formulated a fourth model, which 
extends the SIRS model by allowing for phage to build up in the 
environment (as infected individuals shed pathogen into surface 
waters) and ultimately interrupt transmission (Fig. lc and 
Supplementary Equations). Using the same methods, we estimated 
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the strength and timescale of this effect; the results are presented in 
the Supplementary Information. In brief, the main conclusions 
described above are robust to the inclusion of the environmental 
phage effect: the estimated duration of immunity remains short 
(9.3 + 8.3 weeks), fatality low (0.004 + 0.002, compare the corres- 
ponding value for the SIRS model) and Rp small (1.6 + 0.3). 
Overall, the historical mortality data are not strong evidence in 
favour of the environmental-phage hypothesis. Furthermore, a crit- 
ical implication of the environmental-phage hypothesis is that the 
phage cycles must lag behind cholera cycles, but not too far: the initial 
drop in cholera cases is predicted to be roughly coincident with high 
levels of ambient phage. In the one instance where concurrent data on 
cholera cases and ambient phage densities are available, however, the 
phage appear to lag approximately 180 degrees behind cholera 
cases™*. Thus cholera cases begin to fall while phage concentrations 
remain small, and cholera cases begin to climb again while phage 
densities remain high. This is further, independent, evidence against 
the notion that phage—vibrio interaction in the environment is 
responsible for cholera seasonality. 

The models presented here make several testable predictions. In 
particular, our results suggest that population surveys using sensitive 
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Figure 3 | Typical model simulations versus data for the district of Dacca. 
a, Simulated susceptible fraction (S/H) from our SIRS model with seasonal 
reservoir (blue) and from the time-series SIRS (TSIRS) model of Koelle et 
al.'*"'” (red). The estimated susceptible fraction of the population is much 
lower under the TSIRS model, and grows with time. By contrast, under the 
models introduced here, rapid waning of immunity allows the susceptible 
fraction to return seasonally to high levels. The predicted high prevalence of 
inapparent infection implies that the susceptible pool is rapidly depleted 
during an outbreak. b, Simulated monthly deaths from the TSIRS model. 
Despite the low susceptible fraction and that model’s non-mass-action 
transmission assumption, the relatively low prevalence of inapparent 
infection leads to overly explosive outbreaks and overly deep crashes. 

c, Simulated monthly deaths from the SIRS model with seasonal reservoir. 
d, Cholera death data from Dacca district. Parameter estimates for the 
seasonal-reservoir SIRS model are reported in the Supplementary 
Information; parameters for the TSIRS model are from K. Koelle (personal 
communication). The semi-parametric regression used to fit the TSIRS 
model is initialized using the first 11 yr of data: no simulated data are 
therefore generated over that interval. 
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techniques to detect the presence of V. cholerae, cholera-specific bac- 
teriophage and/or coproantibodies in stools of healthy and mildly 
symptomatic individuals should reveal a much higher prevalence of 
inapparent infection than is currently understood to be the case. Such 
inapparent infections are predicted to be associated with reduced sus- 
ceptibility to infection. Studies that centre on cholera patients, on the 
other hand, because they focus on the tail of the distribution of disease 
severity, can be expected to yield severely biased results when extra- 
polated to the population level. Further implications of our results for 
public health are discussed in the Supplementary Information. 

In all models of epidemiological dynamics, critical quantities 
include the fraction of individuals susceptible to infection, immune 
from infection and asymptomatically infected. Yet it is almost never 
the case that available data include direct measurements of these 
quantities. Moreover, it is frequently true that many asympto- 
matic—or merely unrecorded—infections exist for each case that is 
recorded. Our results demonstrate that conclusions about the bio- 
logical mechanisms underlying a pattern of disease outbreaks can 
depend sensitively on the prevalence of inapparent infections. New 
modelling approaches that allow indirect inference in the face of 
hidden variables and incomplete information are therefore likely to 
revise our understanding of the ecology of many infectious diseases. 


METHODS SUMMARY 


All models were formulated as stochastic differential equations, which were 
integrated using the Euler-Maruyama algorithm. Details of the equations and 
simulation methods are given in the Methods. The iterated filtering algorithm’ 
was used to fit the models to the data. This algorithm is described in detail in the 
Supplementary Methods and is implemented as part of an open-source package, 
pomp, within the R statistical computing environment”. All codes will be made 
available by the authors on request. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Inference for dynamical systems. A stochastic dynamical system is generated by 
a suitable function f(x, s, f, 0, W). Here, Wis a stochastic quantity that is drawn 
independently each time the function is evaluated and 0 is a vector of parameters. 
Thus, for a sequence of times fo = t, S++ S fy, the recursion 


X(tn) =f (X(th—1).tn—1>tn.0,W), 


together with the initial value X(t), specifies a Markov process. The unspecified 
nature of f(-) reflects the idea that the evolution of the system could correspond 
to an essentially arbitrary stochastic simulation algorithm. In the context of this 
paper, X(t) is a vector of counts in each disease class and f(*) corresponds to a 
(numerical) solution of any of the stochastic differential equation systems in the 
Supplementary Equations. The data consist of observations y;, -*- , y,, at times f,, 

“+, ty We suppose that, conditional on {X(t,) =xr};—1.¥n is drawn from a 
density g(Vnlxin tn 0). 

In the Supplementary Methods, we present a pseudocode description of an 
iterated filtering methodology (maximization by iterated filtering, MIF) which 
has recently been shown to enable likelihood-based inference for this frame- 
work’. This procedure allows inference for rather general partially observed, 
stochastic, multivariate, continuous-time dynamical systems. The MIF algo- 
rithm also accommodates measurement error, non-stationarity, irregular obser- 
vation times and the mechanistic inclusion of covariates. It has been 
implemented in the R statistical language’® as part of the package pomp. Note 
that the algorithmic parameters play a role in the computational efficiency of the 
method but do not affect the scientific conclusions once likelihood maximiza- 
tion has been confirmed by diagnostic plots’. The package and all other codes 
will be made available by the first author on request. 

Fitting the Bengal data. Our statistical inference technique is based on simu- 
lation of the stochastic dynamical system. To simulate the stochastic dynamical 
systems, we used the Euler—-Maruyama numerical scheme” with a timestep of 
0.05 months. We fit the parameters of each model to the data from each district 
using the algorithm given in the Supplementary Methods. We used J = 10,000 
particles, a fixed lag L = 60 and M = 80 iterations with a cooling factor, a = 0.95. 
Using the same algorithm, we then constructed profile likelihoods against the 
parameters ¢ and c. This analysis revealed the existence of weak non-identifi- 
abilities among the parameters c, y, m, ¢ and p. Despite this, certain combinations 
of parameters, such as Ry = CP eeas(t))/(y + 6 + m) and the case fatality m/(y + 6 
+ m), are well-identified. Moreover, although estimates of the durations of 
short-term immunity vary considerably among districts, it is the case that, for 
all districts, the duration of immunity is predicted to be far shorter than that 
derived from severe infections induced experimentally'*”. Construction of the 
profile likelihoods also facilitated the location of the global maximum likelihood 
estimates, which, after further refinement using the algorithm above with 30,000 
particles, are reported in the Supplementary Tables. 

Review of likelihood-based inference. We give a review of likelihood-based 
inference in the context of this paper. This review focuses on practical considera- 
tions; we refer the reader to Casella & Berger” or Rice’? for a complete intro- 
duction, and to Barndorff-Nielsen & Cox?! for a more advanced treatment. For a 
discussion of the role of likelihood, profile likelihood ratio tests applied to 
stochastic models for biological systems, we direct the reader to Hilborn & 
Mangel*’. Reasons to use likelihood as a basis for inference include statistical 
efficiency (that is, making the strongest conclusions available from limited data), 
objectivity (that is, providing a single criterion applicable to a wide range of 
models and data) and the availability of useful techniques for comparing rival 
hypotheses. 

For time-series data, written as x):7 = x), °** , X7,a stochastic model consists of 
a density function f(x;,7|0) depending on a vector 0 of unknown parameters. The 
likelihood function arises from considering the density as a function of 0, treat- 
ing the data as fixed. Thus the log likelihood is defined to be 


log £L(0) = log f(x-7|0), 


where log is the natural logarithm. Rival hypotheses, labelled Hp and Hj, con- 
cerning the appropriate structure for a stochastic model can be formalized in 
terms of whether 0 takes a value in ©p or ©, respectively, where Oy and ©, are 
subsets of the set © of all possible parameter vectors. We refer to Hy and H, as 
hypotheses or models interchangeably. 

Comparison of nested models. Suppose that Hp and H, are nested, meaning that 
©, is a subset of ©,. In practical terms, this is taken to mean that Op is described 
by do freely varying parameters, whereas ©, is described by these dy parameters 
together with a number d, — dy of additional parameters. Thus ©, can be written 
in terms of a vector of length dp and ©, in terms of a vector of length d,. The 
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maximum likelihood parameter estimate for model Hp is denoted by 0, and 
corresponds to the parameter value giving rise to the largest value of log £(0) 
among all values of 0 in Qo. A corresponding definition applies to 6). Because ©, 
includes Og, it follows that log £(6:) > log Ll (40). Under standard regularity 


conditions, twice the improvement in the log likelihood that occurs when mov- 
ing from the simpler model Hp to the more complex model H, can be compared 
with a 7° random variable on d,—do degrees of freedom. Thus, Supplementary 
Table 1 presents a P-value of the form 


P=Prob| 73,4 >2{log £(61) — log £ (60) \). (1) 


The P-value given by equation (1) is an approximation. More precise P-values 
can in principle be obtained by simulation, but this 7” approximation is a widely 
used technique that has been found to be reliable for many practical purposes”’. 
It is both conceptually and operationally simpler to calculate initially the stand- 
ard y° approximation in equation (1); if this gives rise to an unequivocal answer, 
there is little reason to present a simulation study as additional evidence. 

Modifications to equation (1) are required when the models under consid- 
eration are nested in a non-standard way™. This arises in the context of the 
analysis presented in Supplementary Table 1, owing to the non-standard nesting 
of the two-path model within the SIRS model. Specifically, the two-path model 
reduces to the SIRS model when c=1 at which point p is unidentified. 
Anisimova et al.** investigated similar situations and demonstrated that the 
chi-squared approximation to the likelihood ratio test is conservative in such 
cases. In the caption of Supplementary Table 1, we report the P-value associated 
with such a test on three degrees of freedom: p, c and the initial condition Yo. 

Non-nested models may still be compared by their likelihood, for example by 

the use of Akaike’s information criterion***’, but formal testing of hypotheses 
becomes more awkward. Thus, in Supplementary Table 1, we contrast the best- 
fitting SIRS model with the nested SIRS model constrained to match the dura- 
tion of immunity found by Koelle & Pascual’*. Another consideration motiv- 
ating this comparison is that the nested SIRS model has a similar likelihood to the 
model of Koelle & Pascual’, while having fewer parameters to estimate. 
Additionally, this approach avoids the issue that the comparison with previously 
published models might be unfair because the procedure used to fit the models in 
this paper may be more effective than that used by previous research. 
Profile likelihood. The profile likelihood is a description of the weight of evid- 
ence about the value ofa single component of the vector of unknown parameters. 
Setting 0 = (0, +++, 0g), the log likelihood can be written as log £(01,...,04) and 
the profile log likelihood of 0, is 


log Lprofite(O1) = max log £(0;,--- ,0z). 


20d 


From equation (1), a 95% confidence region for 0, is given by the range of 
values of 0, for which 


2(log c(8) _ log Lprotte(01)) iC; 


where cis defined by Prob [xi < c| =0.95. This is how the confidence interval in 
Fig. 2 was constructed. 
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Inapparent infections and cholera dynamics 


Aaron A. King’, Edward L. lonides®, Mercedes Pascual'’* & Menno J. Bouma? 


In many infectious diseases, an unknown fraction of infections 
produce symptoms mild enough to go unrecorded, a fact that 
can seriously compromise the interpretation of epidemiological 
records. This is true for cholera, a pandemic bacterial disease, 
where estimates of the ratio of asymptomatic to symptomatic 
infections have ranged from 3 to 100 (refs 1-5). In the absence 
of direct evidence, understanding of fundamental aspects of chol- 
era transmission, immunology and control has been based on 
assumptions about this ratio and about the immunological con- 
sequences of inapparent infections. Here we show that a model 
incorporating high asymptomatic ratio and rapidly waning 
immunity, with infection both from human and environmental 
sources, explains 50 yr of mortality data from 26 districts of 
Bengal, the pathogen’s endemic home. We find that the asympto- 
matic ratio in cholera is far higher than had been previously sup- 
posed and that the immunity derived from mild infections wanes 
much more rapidly than earlier analyses have indicated. We find, 
too, that the environmental reservoir*® (free-living pathogen) is 
directly responsible for relatively few infections but that it may be 
critical to the disease’s endemicity. Our results demonstrate that 
inapparent infections can hold the key to interpreting the patterns 
of disease outbreaks. New statistical methods’, which allow rig- 
orous maximum likelihood inference based on dynamical models 
incorporating multiple sources and outcomes of infection, season- 
ality, process noise, hidden variables and measurement error, 
make it possible to test more precise hypotheses and obtain unex- 
pected results. Our experience suggests that the confrontation of 
time-series data with mechanistic models is likely to revise our 
understanding of the ecology of many infectious diseases. 

Cholera is a diarrhoeal disease caused by enteric infection with the 
bacterium Vibrio cholerae. Six of the seven cholera pandemics that 
have swept the globe since 1817 originated in the low-lying, densely 
populated regions north of the Bay of Bengal, where the disease is 
endemic. Although much attention has been focused on cholera’, 
unsolved puzzles remain about its mode of transmission and the role 
of host immunity in its dynamics. This is largely because, in regions 
where cholera is endemic, most cholera cases are mild or asympto- 
matic but the true extent of asymptomatic infection has been difficult 
to assess. Estimates of the ratio of asymptomatic to symptomatic 
cases vary greatly, and the importance of inapparent infections in 
the dynamics of cholera outbreaks is unknown. To determine what 
role is played by inapparent infections, we used an approach that 
allows indirect inference about unobserved variables. 

A remarkably rich data set on the pattern of cholera epidemics 
exists in the form of mortality records kept by the sanitary commis- 
sioners of the former British East Indian province of Bengal’. The 
data consist of monthly cholera death counts in each of 26 districts 
over the period 1891-1940 (Supplementary Fig. 1). To analyse these 
data, we formulated a series of models incorporating known 
or hypothesized mechanisms of transmission and immunity. A 


parsimonious model for cholera dynamics is of susceptible— 
infectious—recovered—susceptible (SIRS) form (Fig. 1a). A novel fea- 
ture of this model is that it incorporates both transmission tied to 
human prevalence (using a traditional mass-action term) and trans- 
mission from an environmental reservoir (where the pathogen is 
commonly living in aquatic environments)**'*"'?. This model is a 
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Figure 1| The mechanistic models used. a, SIRS model; b, two-path model; 
c, environmental-phage model. Births, related to the total population size H, 
are assumed to feed the pool of susceptibles, S. Individuals are susceptible to 
infection when born. Exposure to the pathogen occurs at time-dependent 
rate /(t). cis the probability that an exposure leads to a contagious infection 
(class I). Note that when c = 1 and p = , the two-path model (b) reduces to 
the SIRS model (a); when c < 1, some exposures result in short-term 
immunity (class Y). Infected individuals die at an excess rate m and recover 
at a rate y; the time an individual spends within the I class is exponentially 
distributed. We assume that an individual remains immune to reinfection 
for a duration gamma-distributed with mean 1/e and variance 1/ke”. Once 
immunity has waned, an individual re-enters the susceptible pool (S). The 
measured variable is monthly deaths, M. The mean duration of short-term 
immunity is 1/p. Individuals in each class are subject to constant 
background mortality at rate 0.02 yr_'. The force of infection, /(£), includes 
terms for environmental and human sources of infection and is assumed to 
vary seasonally. Because the seasonality of cholera dynamics in Bengal is 
complex, we used a semi-mechanistic approach: transmission was modelled 
by a flexible periodic function of time. In the environmental-phage model 
(c), as infected hosts shed pathogen, phage @ builds up in the environment 
and reduces transmissibility. The equations specifying these models are 
given in the Supplementary Equations. 
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partially observed, nonlinear, stochastic dynamical system in con- 
tinuous time and as such is not amenable to analysis by standard 
statistical techniques. Inference for such systems has recently been 
facilitated by a new likelihood maximization procedure’ which 
allows for measurement error, non-stationarity, irregular sampling 
intervals and the mechanistic inclusion of covariates (see Methods). 
We fitted the SIRS model to each district’s data, assessed its explan- 
atory power using log likelihood and the Akaike information cri- 
terion (AIC,) and compared it with a semi-mechanistic time-series 
model for cholera’? and two non-mechanistic time-series models. 

The SIRS model explained the data dramatically better than the 
previous best fit!’ (AAIC, > 58; Supplementary Table 1). Parameter 
estimates reveal some surprises. A prediction of the model, robustly 
consistent across districts, is that immunity must wane on a timescale 
of weeks to months (Fig. 2 and Supplementary Table 3). This is in 
stark contrast to the widely held belief that infection-derived immun- 
ity to cholera wanes on a timescale of 3-10 yr (refs 13-17). The model 
also robustly predicts low case fatality (0.004 + 0.002, mean = s.d. 
across 26 districts). Because fatality among hospitalized cases was 
historically in excess of 50%’*, this implies a very large asymptomatic 
ratio. The implication is that most exposures result not in severe 
cholera, but in mild or asymptomatic infection, the immunological 
memory of which is short-lived. This prediction of a high asympto- 
matic ratio is consistent with the only intensive field studies of inap- 
parent infections of which we are aware**. However, because in this 
model all infected individuals are equally infectious, the prediction is 
that the vast majority of infectives are ‘silent shedders’—infectious 
but without symptoms. The evidence for such a conclusion is mixed 
but weak at best”. Moreover, because in this model all infections are 
alike, it is unclear whether the improved dynamical description 
afforded by the SIRS model over the previous best fit is due to the 
higher proportion of asymptomatic infections or to the shorter dura- 
tion of immunity. 

To tease apart the immunological consequences of exposure to the 
pathogen from the degree of infectiousness induced, and to allow for 
heterogeneity in infectiousness, we formulated a second mechanistic 
model in which exposed individuals can follow an alternate pathway, 
deriving immunity to infection while shedding a negligible quantity 
of vibrio and suffering a negligible disease-induced mortality 
(Fig. 1b). In essence, this model allows for the possibility that expo- 
sure to the pathogen effectively vaccinates against reinfection. We 
leave it to the data to determine how frequently such natural vac- 
cination occurs and how long the resulting immunity lasts. 

Under the two-path model, the timescale of short-term immunity 
(9.9 + 4.7 weeks, mean + s.d. across 26 districts) agrees with that of 
the SIRS model. The estimated case fatality under this model is 
0.34 + 0.21—consistent with conditions during this period'—and 
survivors of severe infections receive longer-lasting immunity 
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Figure 2 | Profile likelihood of the duration of immunity, 1/¢, in the SIRS 
model for the data from Dacca district. Dacca (now spelled Dhaka) is the 
only district for which comparable analyses have been performed'*”’.To 
compute the profile likelihood, ¢ is fixed and the likelihood is maximized 
over the remaining parameters. The vertical dashed lines embrace the 
approximate 95% confidence interval (2-12 weeks). 
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(1.5+0.7yr), consistent with experimental evidence'*’>”. 


Although in some districts the evidence strongly favours the two- 
path model, the data taken as a whole are equivocal on the subject of 
which model is better (see Supplementary Table 2). The implication 
is that it is the short-term immunity and high asymptomatic ratio 
predicted by both models, and not a large fraction of silent shedders, 
that accounts for the dramatically improved description afforded by 
our models. 

A second robust prediction of both models is that, in typical epi- 
demics, relatively few cases are due to the environmental reservoir. 
More precisely, though our models cannot speak to the exact route of 
transmission (that is, food-borne, water-borne, fomites, etc.), our 
results clearly indicate the relative importance of the positive feed- 
back associated with the human-associated source of infection over 
the human-independent environmental source of infection. 
Nevertheless, the seasonally averaged basic reproductive number, 
Ro, for human-associated transmission is estimated to be quite low 
(SIRS model, 1.5 = 0.2; two-path model, 1.5 + 0.2). This contrasts 
sharply with the much higher values of Ro recently proposed” and 
suggests, paradoxically, that for V. cholerae, in this the region where 
cholera is most persistent and outbreaks most frequent, humans are 
potentially a marginal habitat: the persistence of the disease may be 
largely due to the environmental reservoir which provides a small 
extrinsic force of infection. The strength of this environmental force 
of infection varies geographically in a pattern (Supplementary Fig. 2) 
that mirrors the earlier suggestion that the coastal regions of 
Bangladesh are the native habitat of classical cholera’'. Both the 
SIRS and two-path models make the simplifying assumption that 
the force of infection due to the environmental reservoir has neg- 
ligible seasonal fluctuation. To relax this assumption”, we fit a third 
model which extends the SIRS model but allows for seasonality in the 
environmental reservoir. This model explains the data significantly 
better in many districts, but our conclusions about low Ro, rapid loss 
of immunity and high prevalence of inapparent infection remain 
unchanged (see Supplementary Information). 

A central challenge in cholera epidemiology is to explain how the 
outbreaks can be so explosive at the outset and yet self-limiting to the 
point that sizable epidemics can recur twice annually. Our explana- 
tion contrasts sharply with those proposed earlier'*’’. Simulations of 
the various models (Fig. 3) make plain the contrasts between the 
explanations. In the previous view, infection-derived immunity 
tends to wane over the scale of years, the explosiveness of outbreaks 
is controlled by spatial effects, and it is the seasonal drop in transmis- 
sibility (presumably related to environmental drivers) that stops the 
epidemics. Because in the previous view the ratio of deaths to infec- 
tions is comparatively modest, the susceptible population is depleted 
only slightly by each epidemic and is continually replenished by 
births and the waning of immunity. In the new view, epidemics are 
again triggered by rising seasonal transmissibility and the presence of 
a reservoir, but the high asymptomatic ratio means that many more 
individuals are exposed. The vast majority of these receive short-term 
protection from infection so that depletion of the susceptible pool 
brings the epidemic to a halt. As immunity wanes, on the timescale of 
weeks to months, the susceptible pool is replenished, setting the stage 
for the next outbreak. In the new view, severe cases play little role in 
shaping the dynamics. 

It has been proposed that the seasonality of cholera in this region 
may be due to the interaction of free-living vibrio with lytic bacterio- 
phage in surface waters’**°. According to this hypothesis, as an epi- 
demic progresses, the density of phage in the environment increases 
and survivorship of vibrio consequently declines, which in turn leads 
to attenuation of transmissibility. To determine whether such an 
effect might explain the data, we formulated a fourth model, which 
extends the SIRS model by allowing for phage to build up in the 
environment (as infected individuals shed pathogen into surface 
waters) and ultimately interrupt transmission (Fig. lc and 
Supplementary Equations). Using the same methods, we estimated 
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the strength and timescale of this effect; the results are presented in 
the Supplementary Information. In brief, the main conclusions 
described above are robust to the inclusion of the environmental 
phage effect: the estimated duration of immunity remains short 
(9.3 + 8.3 weeks), fatality low (0.004 + 0.002, compare the corres- 
ponding value for the SIRS model) and Rp small (1.6 + 0.3). 
Overall, the historical mortality data are not strong evidence in 
favour of the environmental-phage hypothesis. Furthermore, a crit- 
ical implication of the environmental-phage hypothesis is that the 
phage cycles must lag behind cholera cycles, but not too far: the initial 
drop in cholera cases is predicted to be roughly coincident with high 
levels of ambient phage. In the one instance where concurrent data on 
cholera cases and ambient phage densities are available, however, the 
phage appear to lag approximately 180 degrees behind cholera 
cases™*. Thus cholera cases begin to fall while phage concentrations 
remain small, and cholera cases begin to climb again while phage 
densities remain high. This is further, independent, evidence against 
the notion that phage—vibrio interaction in the environment is 
responsible for cholera seasonality. 

The models presented here make several testable predictions. In 
particular, our results suggest that population surveys using sensitive 
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Figure 3 | Typical model simulations versus data for the district of Dacca. 
a, Simulated susceptible fraction (S/H) from our SIRS model with seasonal 
reservoir (blue) and from the time-series SIRS (TSIRS) model of Koelle et 
al.'*"'” (red). The estimated susceptible fraction of the population is much 
lower under the TSIRS model, and grows with time. By contrast, under the 
models introduced here, rapid waning of immunity allows the susceptible 
fraction to return seasonally to high levels. The predicted high prevalence of 
inapparent infection implies that the susceptible pool is rapidly depleted 
during an outbreak. b, Simulated monthly deaths from the TSIRS model. 
Despite the low susceptible fraction and that model’s non-mass-action 
transmission assumption, the relatively low prevalence of inapparent 
infection leads to overly explosive outbreaks and overly deep crashes. 

c, Simulated monthly deaths from the SIRS model with seasonal reservoir. 
d, Cholera death data from Dacca district. Parameter estimates for the 
seasonal-reservoir SIRS model are reported in the Supplementary 
Information; parameters for the TSIRS model are from K. Koelle (personal 
communication). The semi-parametric regression used to fit the TSIRS 
model is initialized using the first 11 yr of data: no simulated data are 
therefore generated over that interval. 
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techniques to detect the presence of V. cholerae, cholera-specific bac- 
teriophage and/or coproantibodies in stools of healthy and mildly 
symptomatic individuals should reveal a much higher prevalence of 
inapparent infection than is currently understood to be the case. Such 
inapparent infections are predicted to be associated with reduced sus- 
ceptibility to infection. Studies that centre on cholera patients, on the 
other hand, because they focus on the tail of the distribution of disease 
severity, can be expected to yield severely biased results when extra- 
polated to the population level. Further implications of our results for 
public health are discussed in the Supplementary Information. 

In all models of epidemiological dynamics, critical quantities 
include the fraction of individuals susceptible to infection, immune 
from infection and asymptomatically infected. Yet it is almost never 
the case that available data include direct measurements of these 
quantities. Moreover, it is frequently true that many asympto- 
matic—or merely unrecorded—infections exist for each case that is 
recorded. Our results demonstrate that conclusions about the bio- 
logical mechanisms underlying a pattern of disease outbreaks can 
depend sensitively on the prevalence of inapparent infections. New 
modelling approaches that allow indirect inference in the face of 
hidden variables and incomplete information are therefore likely to 
revise our understanding of the ecology of many infectious diseases. 


METHODS SUMMARY 


All models were formulated as stochastic differential equations, which were 
integrated using the Euler-Maruyama algorithm. Details of the equations and 
simulation methods are given in the Methods. The iterated filtering algorithm’ 
was used to fit the models to the data. This algorithm is described in detail in the 
Supplementary Methods and is implemented as part of an open-source package, 
pomp, within the R statistical computing environment”. All codes will be made 
available by the authors on request. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Inference for dynamical systems. A stochastic dynamical system is generated by 
a suitable function f(x, s, f, 0, W). Here, Wis a stochastic quantity that is drawn 
independently each time the function is evaluated and 0 is a vector of parameters. 
Thus, for a sequence of times fo = t, S++ S fy, the recursion 


X(tn) =f (X(th—1).tn—1>tn.0,W), 


together with the initial value X(t), specifies a Markov process. The unspecified 
nature of f(-) reflects the idea that the evolution of the system could correspond 
to an essentially arbitrary stochastic simulation algorithm. In the context of this 
paper, X(t) is a vector of counts in each disease class and f(*) corresponds to a 
(numerical) solution of any of the stochastic differential equation systems in the 
Supplementary Equations. The data consist of observations y;, -*- , y,, at times f,, 

“+, ty We suppose that, conditional on {X(t,) =xr};—1.¥n is drawn from a 
density g(Vnlxin tn 0). 

In the Supplementary Methods, we present a pseudocode description of an 
iterated filtering methodology (maximization by iterated filtering, MIF) which 
has recently been shown to enable likelihood-based inference for this frame- 
work’. This procedure allows inference for rather general partially observed, 
stochastic, multivariate, continuous-time dynamical systems. The MIF algo- 
rithm also accommodates measurement error, non-stationarity, irregular obser- 
vation times and the mechanistic inclusion of covariates. It has been 
implemented in the R statistical language’® as part of the package pomp. Note 
that the algorithmic parameters play a role in the computational efficiency of the 
method but do not affect the scientific conclusions once likelihood maximiza- 
tion has been confirmed by diagnostic plots’. The package and all other codes 
will be made available by the first author on request. 

Fitting the Bengal data. Our statistical inference technique is based on simu- 
lation of the stochastic dynamical system. To simulate the stochastic dynamical 
systems, we used the Euler—-Maruyama numerical scheme” with a timestep of 
0.05 months. We fit the parameters of each model to the data from each district 
using the algorithm given in the Supplementary Methods. We used J = 10,000 
particles, a fixed lag L = 60 and M = 80 iterations with a cooling factor, a = 0.95. 
Using the same algorithm, we then constructed profile likelihoods against the 
parameters ¢ and c. This analysis revealed the existence of weak non-identifi- 
abilities among the parameters c, y, m, ¢ and p. Despite this, certain combinations 
of parameters, such as Ry = CP eeas(t))/(y + 6 + m) and the case fatality m/(y + 6 
+ m), are well-identified. Moreover, although estimates of the durations of 
short-term immunity vary considerably among districts, it is the case that, for 
all districts, the duration of immunity is predicted to be far shorter than that 
derived from severe infections induced experimentally'*”. Construction of the 
profile likelihoods also facilitated the location of the global maximum likelihood 
estimates, which, after further refinement using the algorithm above with 30,000 
particles, are reported in the Supplementary Tables. 

Review of likelihood-based inference. We give a review of likelihood-based 
inference in the context of this paper. This review focuses on practical considera- 
tions; we refer the reader to Casella & Berger” or Rice’? for a complete intro- 
duction, and to Barndorff-Nielsen & Cox?! for a more advanced treatment. For a 
discussion of the role of likelihood, profile likelihood ratio tests applied to 
stochastic models for biological systems, we direct the reader to Hilborn & 
Mangel*’. Reasons to use likelihood as a basis for inference include statistical 
efficiency (that is, making the strongest conclusions available from limited data), 
objectivity (that is, providing a single criterion applicable to a wide range of 
models and data) and the availability of useful techniques for comparing rival 
hypotheses. 

For time-series data, written as x):7 = x), °** , X7,a stochastic model consists of 
a density function f(x;,7|0) depending on a vector 0 of unknown parameters. The 
likelihood function arises from considering the density as a function of 0, treat- 
ing the data as fixed. Thus the log likelihood is defined to be 


log £L(0) = log f(x-7|0), 


where log is the natural logarithm. Rival hypotheses, labelled Hp and Hj, con- 
cerning the appropriate structure for a stochastic model can be formalized in 
terms of whether 0 takes a value in ©p or ©, respectively, where Oy and ©, are 
subsets of the set © of all possible parameter vectors. We refer to Hy and H, as 
hypotheses or models interchangeably. 

Comparison of nested models. Suppose that Hp and H, are nested, meaning that 
©, is a subset of ©,. In practical terms, this is taken to mean that Op is described 
by do freely varying parameters, whereas ©, is described by these dy parameters 
together with a number d, — dy of additional parameters. Thus ©, can be written 
in terms of a vector of length dp and ©, in terms of a vector of length d,. The 
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maximum likelihood parameter estimate for model Hp is denoted by 0, and 
corresponds to the parameter value giving rise to the largest value of log £(0) 
among all values of 0 in Qo. A corresponding definition applies to 6). Because ©, 
includes Og, it follows that log £(6:) > log Ll (40). Under standard regularity 


conditions, twice the improvement in the log likelihood that occurs when mov- 
ing from the simpler model Hp to the more complex model H, can be compared 
with a 7° random variable on d,—do degrees of freedom. Thus, Supplementary 
Table 1 presents a P-value of the form 


P=Prob| 73,4 >2{log £(61) — log £ (60) \). (1) 


The P-value given by equation (1) is an approximation. More precise P-values 
can in principle be obtained by simulation, but this 7” approximation is a widely 
used technique that has been found to be reliable for many practical purposes”’. 
It is both conceptually and operationally simpler to calculate initially the stand- 
ard y° approximation in equation (1); if this gives rise to an unequivocal answer, 
there is little reason to present a simulation study as additional evidence. 

Modifications to equation (1) are required when the models under consid- 
eration are nested in a non-standard way™. This arises in the context of the 
analysis presented in Supplementary Table 1, owing to the non-standard nesting 
of the two-path model within the SIRS model. Specifically, the two-path model 
reduces to the SIRS model when c=1 at which point p is unidentified. 
Anisimova et al.** investigated similar situations and demonstrated that the 
chi-squared approximation to the likelihood ratio test is conservative in such 
cases. In the caption of Supplementary Table 1, we report the P-value associated 
with such a test on three degrees of freedom: p, c and the initial condition Yo. 

Non-nested models may still be compared by their likelihood, for example by 

the use of Akaike’s information criterion***’, but formal testing of hypotheses 
becomes more awkward. Thus, in Supplementary Table 1, we contrast the best- 
fitting SIRS model with the nested SIRS model constrained to match the dura- 
tion of immunity found by Koelle & Pascual’*. Another consideration motiv- 
ating this comparison is that the nested SIRS model has a similar likelihood to the 
model of Koelle & Pascual’, while having fewer parameters to estimate. 
Additionally, this approach avoids the issue that the comparison with previously 
published models might be unfair because the procedure used to fit the models in 
this paper may be more effective than that used by previous research. 
Profile likelihood. The profile likelihood is a description of the weight of evid- 
ence about the value ofa single component of the vector of unknown parameters. 
Setting 0 = (0, +++, 0g), the log likelihood can be written as log £(01,...,04) and 
the profile log likelihood of 0, is 


log Lprofite(O1) = max log £(0;,--- ,0z). 


20d 


From equation (1), a 95% confidence region for 0, is given by the range of 
values of 0, for which 


2(log c(8) _ log Lprotte(01)) iC; 


where cis defined by Prob [xi < c| =0.95. This is how the confidence interval in 
Fig. 2 was constructed. 
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Internal brain state regulates membrane potential 
synchrony in barrel cortex of behaving mice 


James F. A. Poulet! & Carl C. H. Petersen! 


Internal brain states form key determinants for sensory percep- 
tion, sensorimotor coordination and learning’’. A prominent 
reflection of different brain states in the mammalian central nerv- 
ous system is the presence of distinct patterns of cortical syn- 
chrony, as revealed by extracellular recordings of the 
electroencephalogram, local field potential and action potentials. 
Such temporal correlations of cortical activity are thought to be 
fundamental mechanisms of neuronal computation*"’. However, 
it is unknown how cortical synchrony is reflected in the intracel- 
lular membrane potential (V,,) dynamics of behaving animals. 
Here we show, using dual whole-cell recordings from layer 2/3 
primary somatosensory barrel cortex in behaving mice, that the 
Vin of nearby neurons is highly correlated during quiet wakeful- 
ness. However, when the mouse is whisking, an internally gener- 
ated state change reduces the V,, correlation, resulting in a 
desynchronized local field potential and electroencephalogram. 
Action potential activity was sparse during both quiet wakefulness 
and active whisking. Single action potentials were driven by a 
large, brief and specific excitatory input that was not present in 
the V,, of neighbouring cells. Action potential initiation occurs 
with a higher signal-to-noise ratio during active whisking than 
during quiet periods. Therefore, we show that an internal brain 
state dynamically regulates cortical membrane potential syn- 
chrony during behaviour and defines different modes of cortical 
processing. 

To explore their environment actively, mice rapidly move their 
whiskers back and forth at about 8-12 Hz ina stereotypical behaviour 
termed whisking”’*. Cortical sensory processing differs markedly 
between such active whisking periods and quiet wakefulness, when 
the whiskers are not moving'*’. The flow of sensory information 
from receptors in the whisker follicle, via the infraorbital nerve 
(ION), brainstem and thalamus to the primary somatosensory barrel 
cortex (Fig. 1a), is therefore strongly regulated by behavioural state. 
Here we examine the membrane potential dynamics of layer 2/3 
neurons in the barrel cortex of behaving mice and use high-speed 
(500 Hz) filming of whisker movement as a measure of the beha- 
vioural state (Fig. 1b). 

Neurons in layer 2/3 are of particular interest because their activity 
can drive behaviour’® and because they are readily accessible for 
optical imaging in behaving mice’*'’. Our recordings revealed prom- 
inent changes in cortical dynamics", termed a change in ‘brain state’, 
as the quiet, awake mouse begins to whisk (Fig. 1c—h). Slow (1-5 Hz), 
large-amplitude membrane potential (V,,,) fluctuations characterize 
the quiet brain state and are prominent during quiet wakefulness. 
These slow oscillations are significantly reduced in amplitude during 
whisking, defining an active brain state (Fig. Ic). 

The primary sensory neurons of the IONs, which signal whisker 
movement to the brain, are silent during quiet wakefulness but fire 
action potentials (APs) during whisking’*"’. Therefore, the change in 


brain state could result from sensory feedback generated during 
whisker movements, or, alternatively, it could be centrally generated 
within the central nervous system’. To distinguish between these 
two sources, we made bilateral sections of the IONs (Fig. 1a, d). 
Whisking was not impaired by cutting the IONs, because the facial 
nerves, containing the motor neuron axons, were left intact. During 
whisking, mice with intact (1 = 10) and cut (m= 10) sensory nerves 
showed a strong and significant decrease in the power of low fre- 
quency (1-5 Hz) membrane potential oscillations (Fig. le) and in the 
membrane potential variance (Fig. 1f). In both groups of mice, layer 
2/3 cortical neurons depolarized slightly but significantly during 
whisking (Fig. 1g) without overall changes in firing rates (Fig. 1h). 
None of these parameters was significantly different when comparing 
preparations in which the nerve was intact to those in which the nerve 
was cut (Fig. le-h). Therefore, the most prominent changes in brain 
state comparing quiet wakefulness to whisking periods do not 
depend on sensory input from the whiskers. However, detailed exam- 
ination of the membrane potential dynamics with respect to the 
millisecond-by-millisecond trajectory of whisker position revealed 
differences between mice with intact and cut IONs (Supplementary 
Fig. 1). Averaging of consecutive whisking cycles aligned to the peak 
of protraction showed that the membrane potential of layer 2/3 
pyramidal neurons in mice with intact sensory nerves exhibited a 
fluctuation highly correlated with whisker position’ (Fig. 1i, j). In 
mice with cut IONs, this fast whisker-position-dependent membrane 
potential fluctuation was not observed (Fig. 11, j), in agreement with a 
previous extracellular study’. These data provide functional evid- 
ence, alongside the post-mortem anatomical verification, that the 
IONs were indeed cut. Despite cutting the IONs, the characteristic 
change in brain state persisted during whisking. We therefore con- 
clude that it must be generated internally, within the central nervous 
system, and defines an active cortical processing mode. 

The key differences between the quiet and the active brain states 
were the changes in the slow subthreshold membrane potential 
dynamics. During anaesthesia, slow membrane potential oscillations 
are highly synchronized”. We therefore next explored the relation- 
ship in behaving mice between single neuron membrane potential 
dynamics and local network activity measured simultaneously 
through electroencephalogram (EEG) and the local field potential 
(LEP; Fig. 2a and Supplementary Figs 2 and 3). The slow large-ampli- 
tude fluctuations in V,, during quiet wakefulness correlated closely 
with both LFP and EEG measurements (Fig. 2b—d). However, EEG, 
LFP and V,, fluctuations decorrelated during active whisking periods 
(Fig. 2b-d). These state-dependent changes in oscillatory electrical 
brain activity may be analogous to those first observed in recordings 
of the human EEG, where subjects relaxing with their eyes closed had 
slow large-amplitude EEG waves, which disappeared on eye open- 
ing”. Active processing modes in the neocortex, both in mouse and 
human, therefore seem to be associated with desynchronized electrical 
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activity, whereas large-amplitude slow synchronous oscillations are 
prominent during rest (7-12 Hz for human and 1-5 Hz for mouse). 

The simplest explanation for the close correlation between LFP 
and V,, during quiet wakefulness would be that nearby neurons have 
highly synchronous slow membrane potential fluctuations. To test 
this directly, we made simultaneous dual whole-cell recordings from 
nearby barrel cortex neurons in layer 2/3 (Fig. 3). In an example dual 
recording of identified layer 2/3 pyramidal neurons (Fig. 3a), mem- 
brane potential changes were remarkably similar in the two cells 
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Figure 1| Internal brain state determines membrane potential dynamics. 
a, Schematic of whisker sensory pathway; scissors indicate position of the 
bilateral ION cut. b, Single image from high-speed (500 Hz) whisker filming 
showing measurement of whisker angle. c-h, Membrane potential (V,,,) 
dynamics undergo a change in state during whisking (red) as compared to 
quiet periods (black), irrespective of whether the IONs were intact (black) or 
cut (grey). Shown are individual traces from example experiments (¢, d) and 
quantification across recordings (e—h; mean = s.d.; n = 10 intact; n = 10 
cut). i, Example experiments show an average V,, modulation phase-locked 
to whisker movements in a mouse with intact IONs, but not in a mouse with 
cut IONs. j, Averaged across recordings, mice with intact IONs have larger 
Vm modulations phase-locked to whisker movement than mice with cut 
IONs. Mice with intact IONs are different from shuffled data (blue), but 
mice with cut IONs are not different from shuffled data (mean + s.d.; n = 10 
intact; n = 10 cut). 
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during quiet periods. This is obvious through visual inspection of 
the overlaid membrane potential traces (Fig. 3b) and was quantified 
across five different periods in this example experiment by the high 
cross correlation value of 0.79 (Fig. 3c). Whisking induced a robust 
and significant decrease in membrane potential correlation in this 
example experiment to a value of 0.43 (quantified across five periods 
adjacent to those analysed during quiet wakefulness). A similar pat- 
tern of dynamically regulated membrane potential synchrony is 
shown in a_ second example experiment (Fig. 3d-f and 
Supplementary Movie 1; a further example experiment is shown in 
Supplementary Fig. 4). High correlation values of membrane poten- 
tial dynamics during quiet periods (mean zero-time-difference cross 
correlation value +s.d. 0.72 0.11, n=19 mice) and a reduced 
value during active whisking periods (mean zero-time-difference 
cross correlation value 0.33 + 0.17, n= 19 mice) were observed in 
every pair of recorded neurons (n = 19 dual recordings, with a mean 
of 3.8 whisking episodes analysed per dual recording). The decorr- 
elation induced by the active brain state during whisking was highly 
significant (P<0.001; Fig. 3g-i); however, the V,, correlation 
remains significantly above chance level (cross correlation value of 
shuffled traces 0.05 + 0.11, n= 19, P< 0.001). The remaining corre- 
lated membrane potential dynamics during whisking might in part 
be driven by sensory input phase-locked to the whisker cycle (Fig. li, j 
and Supplementary Fig. 5). Fast Fourier transform (FFT) analysis 
reveals that the most notable changes in the power of membrane 
potential fluctuations between quiet and active periods occur in 
the low frequency range (Fig. 3j). Furthermore, spectral coherence 
analysis indicates that low-frequency membrane potential oscilla- 
tions also underlie the major state-dependent changes in the ampli- 
tude of correlation (Fig. 3k). 

These measures of correlated membrane potential fluctuations are 
entirely dominated by subthreshold activity because APs in layer 2/3 
neurons of mouse barrel cortex occur at low rates (mean = s.d., 
0.61 + 0.99 Hz; median of 0.13 Hz; n = 48) both during quiet wake- 
fulness and active whisking periods”* (Fig. 1h and Supplementary Fig. 
6). In our dual whole-cell recordings, we therefore analysed the mem- 
brane potential dynamics immediately surrounding individual APs 
(separated from other APs by at least 100 ms to avoid bursts of APs; 
Fig. 4 and Supplementary Fig. 6). Whereas membrane potential fluc- 
tuations in nearby neurons are highly significantly correlated in both 
quiet wakefulness and whisking, little or no synchronous AP activity 
was observed in nearby neurons (Supplementary Fig. 7). The inde- 
pendence of AP firing in nearby neurons provides specificity for 
information processing. Both during quiet (Fig. 4a, b) and active 
whisking (Fig. 4c, d) periods, APs were evoked at the same threshold 
(quiet —39.5 + 3.3 mV, whisk —39.0 + 3.3 mV, n= 9, P= 0.20) bya 
large, brief pre-spike depolarization. Synaptic integration in mouse 
layer 2/3 pyramidal neurons probably occurs over ~20 ms timescale, 
because this is close to the typical excitatory postsynaptic potential 
half-width and the membrane time-constant. In the 20 ms preceding 
AP initiation, the V,, of the spiking cell depolarized by 9.0 = 2.3 mV 
(quiet 9.7 + 2.6 mV, whisk 8.3 + 2.0mV, n=7, P=0.31; Fig. 4e). 
However, the V,, of the nearby simultaneously recorded neuron 
depolarized only by 1441.2mV (quiet 1.7£1.5mV, whisk 
1.0+0.8mV, n=7, P= 0.28; Fig. 4e). APs are therefore driven by 
specific synaptic input to a subpopulation of neurons. Although 
sensory input during whisking (Fig. 1i, j) might contribute to driving 
these APs, we found a similar value for pre-spike depolarization in 
mice with cut sensory nerves (quiet 9.5+1.7mV, whisk 
8.2 + 1.8 mV, n= 4). The pre-spike depolarization is large and brief, 
allowing APs to occur with high temporal precision, irrespective of 
whether a spike occurs during quiet wakefulness, active whisking or 
active whisking with cut sensory nerves. It is likely that APs are driven 
by large pre-spike depolarization, in part, because threshold is lower 
when approached from a rapidly depolarizing trajectory’® 
(Supplementary Fig. 8). 
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Figure 2 | EEG, LFP and V,, are highly correlated 
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We defined a signal-to-noise ratio for AP initiation in layer 2/3 
cortical neurons as the ratio of the pre-spike depolarization to the 
mean variance of the membrane potential fluctuations. This measure 
shows that APs are driven with significantly higher signal-to-noise 
ratios during whisking (0.54 + 0.2; n= 7) compared to quiet wake- 
fulness (0.24 + 0.11; n= 7; P= 0.008; Fig. 4f) when the pre-spike 
depolarization is measured 20 ms before threshold. This signal-to- 
noise ratio measure remains significantly higher during whisking 
compared to quiet wakefulness when pre-spike depolarizations are 
measured 5 ms, 10 ms and 50 ms before threshold (P< 0.01 in each 
case). 


A large, brief and specific input therefore drives APs in layer 2/3 
barrel cortex neurons of awake mice. Such membrane potential 
dynamics appear well-suited for the reliable coding of information 
in networks with sparse AP activity'*'®’” °°. Future experiments must 
determine whether similar principles govern APs in other cortical 
layers and other cell types. 

Correlated neuronal activity was first observed in EEG recordings 
of the human brain, revealing slow, large-amplitude oscillations dur- 
ing periods of quiet wakefulness”. Since then, extracellular electro- 
physiological studies have found evidence for synchronous activity at 
frequencies ranging from 0.05 Hz to 500 Hz, correlating closely with 
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changes in behavioural state, attention, sensory processing and 
motor planning'"!. Investigation of the underlying subthreshold 
events generating these patterns of cortical activity requires simultan- 
eous dual intracellular recordings of membrane potential. Previous 
studies have shown that nearby cortical neurons exhibit highly syn- 
chronous membrane potential fluctuations under anaesthesia**™; 
however, dual intracellular recordings have never previously been 
made in behaving mammals. Here we have shown that a change in 
brain state dynamically and profoundly regulates cortical membrane 
potential synchrony during behaviour. The change in brain state is 
regulated by an internally generated signal, because it persists after 
cutting the sensory nerves. In future studies, it will be of great interest 
to define the underlying central mechanisms generating the brain 
state changes in somatosensory cortex during whisking, and to deter- 
mine whether they relate to direct effects of neuromodulators in the 
neocortex or to changes in corticocortical or thalamocortical input. 

Our data show that neighbouring layer 2/3 neurons become more 
independent during the active state of the mouse barrel cortex. Active 
cortical brain states may therefore serve to augment the total cortical 
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Figure 4 | Action potentials result from large, brief and specific input 
during both quiet wakefulness and whisking. a, c, Simultaneous membrane 
potential recording from two layer 2/3 neurons during quiet wakefulness 
(a) and whisking (c), focusing on single APs in one of the neurons, which we 
arbitrarily label cell 1 (black traces). The V,,, in the nearby neuron, which we 
label cell 2 (blue traces), remains subthreshold, indicating a lack of AP 
synchrony. A single example (left) is accompanied by the spike-triggered 
average for this cell pair (right). b, d, The spike-triggered grand average V,, 
across dual recordings during quiet wakefulness (b) and whisking 

(d) (1 = 7). e, The pre-spike depolarization (PSD) quantifies the change in 
Vim of both cells during the 20 ms preceding AP initiation in cell 1 

(mean = s.d.; n = 7). f, The ratio of the depolarization driving APs (PSD) 
relative to the variance of ongoing V,,, fluctuations (or signal-to-noise ratio 
for evoking a spike) is higher during whisking than during quiet wakefulness 
(mean + s.d.; n = 7). 
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information processing capacity through decorrelation of membrane 
potential synchrony while increasing signal-to-noise ratios for AP 
initiation. 


METHODS SUMMARY 


Male 4—6-week-old C57BL6J mice were implanted with a light-weight metal 
head holder and a recording chamber. They were then habituated to head 
restraint in the recording setup over a period of several days. All whiskers, except 
the C2 whiskers, were trimmed and the location of the left C2 barrel column was 
functionally located through intrinsic optical imaging, which was used to target 
the recordings through a small craniotomy”. Bilateral sectioning of the infra- 
orbital nerves was completed 2-4h before neuronal recording and was con- 
firmed post-mortem. Whole-cell pipettes had resistances of 5-7 MQ and were 
filled with a solution containing (in mM): 135 potassium gluconate, 4 KCl, 10 
HEPES, 10 phosphocreatine, 4 MgATP and 0.3 Na3GTP (adjusted to pH 7.3 with 
KOH), as well as 2mg ml! biocytin to allow anatomical reconstruction of cell 
morphology and location. Only neurons recorded in layer 2/3 were included in 
this study. LFP recordings in the immediate vicinity of the whole-cell recordings 
were made with 5-7 MQ glass electrodes filled with Ringer’s solution. EEG 
recordings were obtained using a metal wire recording electrode on the surface 
of the barrel cortex and a reference electrode on the cerebellum. We used a high- 
speed (500 Hz) camera to film whisker movements under infrared illumination 
and custom-written software to measure whisker angle. Data are presented as 
mean + standard deviation. Statistical significance was assessed using Student’s 
t-tests (paired or unpaired assuming unequal variances). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Animal preparation and surgery. All experiments were carried out in accord- 
ance with the Swiss Federal Veterinary Office, following previously described 
methods'’. Male 4-6-week-old, C57BL6J mice were implanted with a light- 
weight metal head holder and a recording chamber under 1.5% isoflurane. All 
whiskers except C2 were trimmed, and the location of the left C2 barrel column 
was functionally located through intrinsic optical imaging under 1% isoflurane. 
The cortical surface was visualized through the intact bone. The C2 whisker was 
deflected at 10 Hz and the evoked haemodynamic signal was imaged by a Qicam 
CCD camera (Q-imaging) under 630 nm illumination provided by light-emit- 
ting diodes. The images were processed online by custom-written routines run- 
ning in IgorPro (Wavemetrics). A small craniotomy was made over the 
functionally identified location of the C2 barrel column to target whole-cell 
and local field potential recordings. 

Bilateral sectioning of the IONs was completed 2-4 h before neuronal record- 

ing. A small incision was made in the skin exposing the IONs where they emerge 
from the infraorbital fissure’'. Sectioning of the IONs was made through the 
incision starting with the dorsal branches and finishing ventrally. After section- 
ing the IONs, deflection of the C2 whisker never evoked a detectable intrinsic 
optical signal. After every recording session, we confirmed complete sectioning 
of the entire ION on both the left and the right side in a post-mortem examina- 
tion. The large size of the IONs allows a straightforward verification of the nerve 
cut. Whisking behaviour was not overtly different between mice with intact 
nerves and those with cut nerves, as detailed below. 
Habituation to head restraint. Mice readily habituate to head restraint. The first 
head-restrained sessions of each mouse lasted only for a few minutes and this 
time was gradually increased each day until the mouse would sit calmly for a 
period of roughly 1 h. After several days of habituation to head restraint, record- 
ings were performed in a single session lasting up to 2 h, after which the mouse 
was killed to obtain anatomical information. 

Mice never slept while in head fixation during the training period or during 

the recording session, and they remained behaviourally responsive to sensory 
stimuli at all times throughout the recording session. In a previous study, we 
demonstrated that the EEG patterns observed during active whisking periods, 
quiet wakefulness and sleep are quantitatively different (Supplementary Fig. 1 in 
ref. 13). 
Neuronal recordings. Whole-cell recordings from awake head-restrained mice 
were obtained as described previously'’. Pipettes had resistances of 5-7 MQ and 
were filled with a solution containing (in mM): 135 potassium gluconate, 4 KCl, 
10 HEPES, 10 phosphocreatine, 4 MgATP and 0.3 Na3GTP (adjusted to pH 7.3 
with KOH), as well as 2 mg ml ' biocytin. Only neurons recorded in layer 2/3 (at 
depths <500 itm below the pia) were included in this study. Whole-cell electro- 
physiological measurements were made with a Multiclamp 700A amplifier 
(Axon Instruments). The V,,, was filtered at 10 kHz and digitized at 20 kHz by 
ITC-18 (Instrutech Corporation) under the control of IgorPro. No current was 
injected during recording, and V,, was not corrected for liquid junction poten- 
tials. 

LFP recordings were made with 5-7 MQ glass electrodes filled with Ringer’s 
solution. The LFP electrodes were carefully lowered into layer 2/3 into the same 
craniotomy and at the same time as the patch electrodes. Depth measurements 
were <50 1m between the LFP and the patch electrode. The LFP signal was 
amplified by a Multiclamp 700A amplifier, filtered at 10 kHz and digitized at 
20 kHz by ITC-18 under the control of IgorPro. LFP recordings were bandpass 
filtered off-line in IgorPro between 0.1 Hz and 100 Hz. 

EEG recordings using exposed tips of 50-j1m Teflon-coated tungsten wires 

were obtained from the surface of the barrel cortex (~500 um from the craniot- 
omy) with the reference electrode on the cerebellum. The EEG signal was amp- 
lified by CyberAmp 380 amplifier (Axon Instruments), filtered at 0.1 Hz—10 kHz 
and digitized at 20 kHz by ITC-18 under the control of IgorPro. EEG recordings 
were filtered off-line in IgorPro between 0.1 Hz and 100 Hz. 
Recording behaviour. We used a high-speed (500Hz) MotionPro camera 
(Redlake) to film whisker movements under infrared illumination. The field 
of view was 25 X 25mm, with a single pixel resolution of 0.1mm. Custom- 
written image analysis software running in ImageJ was used to measure whisker 
angle. 

Whisking is driven by motoneurons of the facial nucleus, which are controlled 
by several mechanisms, including serotonergic’? and cortical control'>’’. 
Cortical input is not essential to initiate whisking, because it is clear that whisk- 
ing can occur in the absence of cortical activity (for example, after lesions to 
motor cortex)****. We do not know which control mechanisms give rise to the 
whisking observed in our study, but we found that the decorrelation of cortical 
activity occurred robustly in every whisking episode. Therefore, although there 
might be differences in brain state during the head-fixation session, this did not 
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impact on our observations. Most bouts of whisking included in this study were 
initiated in the absence of any overt external event and we therefore consider 
them to be spontaneous. In addition, we found that auditory, olfactory and 
visual stimuli could induce whisking. The change in cortical dynamics and 
synchrony from quiet wakefulness to whisking was robust and did not depend 
on how whisking was initiated. 

Whisking bouts lasted for a similar duration with IONs intact 4.3 + 1.1s 
(n= 10 mice) or IONs cut 4.5 + 1.1s (n= 10 mice). FFT analysis of whisker 
movement showed a peak at 8.5 Hz both for mice with intact (n = 10 mice) and 
for those with cut (n= 10 mice) IONs. 

Neuron staining. Each mouse was deeply anaesthetized with urethane and per- 
fused with 4% paraformaldehyde immediately after the experiment. 
Subsequently, tangential slices were processed for cytochrome oxidase staining 
to reveal the barrel map and for biocytin staining (Vectastain) to reveal the 
morphology of the recorded neuron. Neuronal reconstruction was performed 
in three dimensions with Neurolucida software (MicroBrightField). 

Data analysis. Three independent data sets were gathered for Figs 1, 2 and 3. The 
data shown in Fig. 1 were recorded from 10 neurons in mice with intact IONs and 
10 neurons in mice with cut IONs. Each cell was recorded in a different mouse. 
The data in Fig. 2 were recorded from a further 11 cells in 9 additional mice. The 
data in Fig. 3 were recorded from 19 dual whole-cell recordings with each pair of 
neurons in a different additional mouse. 

For Fig. le-g, properties of membrane potential dynamics were calculated 
from 3-s segments of data taken from a visually identified central period of each 
whisking episode during the recordings and compared to adjacent quiet sections 
of the recording without whisking. Data segments in the nerve-intact configura- 
tion were taken from 41 whisking episodes (range 2-6, mean 4.1 per experiment) 
from 10 experiments; data segments for analysis in the nerve-cut configuration 
were taken from 56 whisking episodes (range 3-8, mean 5.6 per experiment) 
from 10 experiments. The power of low frequency V,,, changes was computed by 
measuring the area under the FFT between 1 Hz and 5 Hz. For Fig. 1h, spike rates 
were calculated across all quiet and whisking periods in the recorded sweeps. The 
recording periods analysed for spiking in the 10 intact preparations were: during 
quiet, total time 729 s (mean + s.d. 73 + 58 s; range 17-196 s); and during whisk- 
ing, total time 376 s (mean + s.d. 38 + 20s; range 16-71 s). In the 10 experiments 
in which the nerve was cut, the recording periods analysed for spiking were: 
during quiet, total time 922 s (mean + s.d. 92 + 57 s; range 15-170 s); and during 
whisking, total time 515s (mean + s.d. 51 + 26s; range 15-98s). For Fig. 1i, 
every recorded whisking cycle during an experiment was aligned at the peak of 
protraction and averaged, revealing both the mean whisker movement during a 
whisking cycle and the mean change in membrane potential. For the example 
experiment with intact IONs a total of 329 whisking cycles were averaged, and for 
the example experiment with cut IONs a total of 457 whisking cycles were 
averaged. For Fig. 1j, modulation depth is defined as the difference in amplitude 
between the most positive and the most negative peak in the average membrane 
potential within +100 ms from the peak of protraction. All recorded whisking 
cycles were included in the average. For the 10 nerve intact preparations we 
analysed a total of 1,802 whisking cycles (mean + s.d. 180 + 96; range 102- 
329). For the 10 nerve cut preparations we analysed a total of 2,961 whisking 
cycles (mean + s.d. 296 + 133; range 132-482). 

The mean value was subtracted and the amplitude of fluctuations was normal- 
ized to the standard deviation for each individual data segment before comput- 
ing the cross correlation” (Figs 2c, d and 3c, f-i). The cross-correlation value is 
the mean from —2 ms to +2 ms. For Fig. 2c, d, cross correlations were computed 
across the central 3-s period of visually identified whisking episodes. In total, 40 
whisking episodes (range 1-8, mean 3.6) were analysed in Fig. 2d, with 11 
experiments combining whole-cell and local field potential recordings; in 9 of 
these experiments we additionally measured the EEG. For Fig. 3c, f-i, cross 
correlations were computed across the central 3-s period of visually identified 
whisking episodes with a total of 72 episodes (range 2-6, mean 3.8 whisking 
episodes per dual recording, total 216 s of recording). Cross correlations of quiet 
periods were computed from the same number of 3-s segments, with each 
analysed segment being adjacent to the whisking episode. For Fig. 3), k, the open 
source Chronux software (http://chronux.org/) was used under Matlab (The 
Mathworks) to generate the FFT and calculate coherence across the 19 dual 
recordings during quiet and whisking. 

For Fig. 4a, d, the average membrane potential of the spiking and nearby cell 
was calculated from all spikes isolated from another spike by >100 ms. The 
grand average represents the average from 7 cells. For Fig. 4e, the pre-spike 
depolarization was calculated as the change in V,, during the 20 ms before the 
spiking cell reached threshold for action potential initiation. For Fig. 4f, the 
signal-to-noise ratio was calculated by dividing the pre-spike depolarization 
leading to action potential initiation by the variance during quiet and whisking 
periods observed in the same recording. 
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All data are presented as mean + standard deviation. Student’s t-tests (two 
tailed, paired or unpaired assuming unequal variances) were performed in Excel 
(Microsoft) to evaluate statistical significance. Unless otherwise stated, the num- 
ber of cells analysed is denoted by ‘nr’. 
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On the spontaneous emergence of cell polarity 


Steven J. Altschuler’, Sigurd B. Angenent”, Yangin Wang’ & Lani F. Wu’ 


Diverse cell polarity networks require positive feedback for locally 
amplifying distributions of signalling molecules at the plasma mem- 
brane’. Additional mechanisms, such as directed transport” or 
coupled inhibitors**, have been proposed to be required for rein- 
forcing a unique axis of polarity. Here we analyse a simple model of 
positive feedback, with strong analogy to the ‘stepping stone’ model 
of population genetics’, in which a single species of diffusible, mem- 
brane-bound signalling molecules can self-recruit from a cytoplas- 
mic pool. We identify an intrinsic stochastic mechanism through 
which positive feedback alone is sufficient to account for the spon- 
taneous establishment of a single site of polarity. We find that the 
polarization frequency has an inverse dependence on the number of 
signalling molecules: the frequency of polarization decreases as the 
number of molecules becomes large. Experimental observation of 
polarizing Cdc42 in budding yeast is consistent with this prediction. 
Our work suggests that positive feedback can work alone or with 
additional mechanisms to create robust cell polarity. 

Many cell types can spontaneously establish and maintain asym- 
metric distributions of signalling molecules on the plasma mem- 
brane’®, even in the apparent absence of pre-existing cues”*. 
Positive feedback circuits, found at the core of diverse biological 
networks, enable signalling molecules localized at the plasma mem- 
brane to initiate processes that further accelerate localized recruit- 
ment. These processes allow signalling molecules, such as Cdc42 in 
budding yeast’, mPar3 and mParé6 in axons of hippocampal neu- 
rons’°, and Rac in canine kidney cells'’ and human chemotaxing 
neutrophils’, to be concentrated within a defined region of the 
plasma membrane. 

Spontaneous cortical polarization can occur through activation of 
positive feedback circuits that do or do not depend on mechanisms of 
directed transport*'*'*. Transport-dependent circuits make use of 
cytoskeletal structure to reinforce spatial asymmetries by directing 
signalling molecules to specified locations on the plasma mem- 
brane”’*'®. However, in the absence of directed transport, it is 
unclear how positive feedback circuits counteract lateral diffusion 
to establish and maintain an asymmetric distribution of signalling 
molecules*". Additional mechanisms have been proposed, such as 
coupled activation-dependent inhibition**’* or depletion of critical 
signalling components’. Because evidence for these other mechan- 
isms is less generally established, we wondered whether positive feed- 
back alone could generate cell polarization. 

The specific details of any given positive feedback circuit can be 
complex and cell-type specific. For example, signalling molecules can 
associate to the plasma membrane through docking with transmem- 
brane receptors, binding to phospholipid headgroups, or insertion of 
prenylated membrane anchors. Additionally, signalling molecules 
such as GTPases can assume multiple states, which in turn strongly 
affect properties of their transport. To discover general principles, as 
with other recent models of polarity'®'”-'’, we abstracted mechanisms 
common to these circuits and constructed a biologically motivated, 
yet mathematically tractable, model of positive feedback. 


Thus, we formulated a model of positive feedback in which sig- 
nalling molecules comprising a single species move between non- 
recruiting, cytoplasmic states and recruiting, plasma-membrane- 
bound states without mechanisms of directed transport (this model 
is referred to simply as ‘positive feedback’ in the remainder of the 
paper). The redistribution of signalling molecules is determined by 
the rates of four fundamental transport mechanisms (Fig. 1; see 
Methods): (1) recruitment (kg,) of cytoplasmic molecules to the 
locations of membrane-bound signalling molecules; (2) spontaneous 
association (k,,) of cytoplasmic molecules to random locations on 
the plasma membrane; (3) lateral diffusion (D) of molecules along 
the membrane; and (4) random disassociation (ko) of signalling 
molecules from the membrane. The model parameters kg,, kn, D 
and ko, and total number of signalling molecules N, are readily 
biologically interpretable, and may be estimated from experimental 
data (Methods). 

In an initial analysis of the dynamics of this positive feedback 
circuit, we ignored the spatial distribution of molecules on the mem- 
brane. We found that the fraction of total signalling molecules on the 
membrane can switch from zero to a final equilibrium h,, with a half 
time of ~1/(hegkp) (Supplementary Information). This equilibrium 
fraction h., can be estimated simply as 1 — ko¢/ kg in the case when ko, 
is smaller than kg, (see Supplementary Information for general 
formulation), and reflects a balance between the net fluxes of mole- 
cules to and from the membrane. Thus, we find that the strength of 
feedback strongly influences both the switching dynamics and the 
magnitude of the equilibrium state. 

We next considered how signalling molecules are distributed spa- 
tially on the plasma membrane. In the case when the number of 
signalling molecules is large, analysis reveals that all spatial variation 
eventually vanishes; membrane densities converge exponentially to a 
spatially homogeneous equilibrium state with membrane fraction heq 


Membrane 


Cytoplasmic pool 


Figure 1| A conceptual model of a positive feedback circuit is characterized 
by five biologically interpretable parameters. Coloured arrows indicate 
four main sources of molecular transport (see text: purple, spontaneous 
membrane association or ‘input’ to positive feedback circuit, with rate k,,,3 
black, spontaneous membrane disassociation, with rate ks red, positive 
feedback, with rate kg,; blue, lateral diffusion, with rate D). Not shown is N, 
the total number of signalling molecules. 
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(Methods). Furthermore, perturbations of this state due to spontan- 
eous association events quickly return to equilibrium (Fig. 2a; see 
Methods). Thus, no polarity can persist, regardless of the feedback 
strength kg. This suggests that for large numbers of molecules, pos- 
itive feedback requires additional biological mechanisms to over- 
come diffusion in order to maintain a persistent switch-like 
distribution in space. This phenomenon has been studied rigorously 
for other positive feedback systems, including many motivated by 
theoretical ecology”. 

In contrast, for small numbers of molecules, unexpected model 
behaviour emerges as stochastic fluctuations can drive cellular beha- 
viours from stable equilibrium to dynamic, non-stable states***. As 
the number of signalling molecules decreases, the expected equilib- 
rium fraction of molecules on the membrane remains hq, with fluc- 
tuations on the order of 1//N. However, we observed that with 
decreased numbers of signalling molecules, distinct polarized regions 
began to emerge spontaneously (Fig. 2b). Additionally, we observed 
that this behaviour is strongly affected by the ratio ky/kg, of spontan- 
eous membrane association to feedback (Fig. 2c and Supplementary 
Fig. 1). When this ratio is large, frequent association of molecules to 
random membrane locations results in a spatially homogeneous den- 
sity (Fig. 2c, first and last 10 min). When this ratio is small, spontan- 
eous association events are rare and local amplification by positive 
feedback frequently results in the emergence of spatially isolated clus- 
ters of signalling molecules (Fig. 2c, middle 40 min). We note that 
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simulations were performed using h.g=0.1 (Methods), hence 
the observed polarization behaviour is not due to depletion of the 
cytosolic pool (Supplementary Materials)’. Thus, these observa- 
tions predict that decreased numbers of signalling molecules, 
decreased spontaneous membrane association rate, or increased pos- 
itive feedback strength can lead to the spontaneous emergence of 
polarity (Box 1). 

Mathematical analysis of the positive feedback system shows that 
transitions among clustering behaviours are entirely determined by a 
simple relationship between the ratio k,,/kg, and the number of 
signalling molecules N: a single cluster will certainly form for small 
onl kip KN * (Fig. 3a, region 1), whereas no clusters will form for 
large kon/ kg > (N- ‘in N)'? (Fig. 3a, region 2). Numerical simula- 
tions indicate that the transition occurs for kon/kg, ~ N~' (Fig. 3a, 
region 3). Cluster formation is possible for large numbers of mole- 
cules, but the larger the value of N, the smaller the value of k,,,/kg, for 
which clustering actually occurs. This phase plane portrait reveals 
that polarization is a robust behaviour that can be tuned by a small 
number of parameters. 

A clear prediction of this theory is that the probability of spontan- 
eous polarization decreases as the number of signalling molecules 
becomes large (Fig. 3a, follow any horizontal line from left to right). 
To test whether this trend could be observed experimentally, we 
made use of an established budding yeast system in which an actin- 
independent core positive feedback circuit has been shown to be 
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Figure 2 | Numerical simulations reveal an inverse dependence between 
polarization frequency and large numbers of signalling molecules. 

a, Positive feedback alone leads to homogeneous steady-state solutions for 
large numbers of signalling molecules (Methods). Inset: evolution of total 
membrane fraction h(t); theoretical equilibrium value h,, = 0.1. a.u., 
arbitrary units. b, Decreasing numbers of signalling molecules lead to 


increasing levels of spatial segregation. Shown are kymographs from 
simulations (y axis, membrane position; x axis, time). Particle density is 
rescaled to units of equilibrium fraction (colour bar). ¢, Small k,,,/kg, can 
lead to the establishment of a dominant clan (Box 1). Clan colour 
assignment in the kymograph is random, and is reset for each molecule after 
membrane disassociation. Model parameters are as in Methods. 
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Box 1| Intuition for particle clustering 


Intuition for the clustering behaviour of positive feedback circuits for 
small numbers of molecules can be obtained through the simple 
concept of lineage tracking (Fig. 2c). The basic idea is to track the 
location and source of recruitment for all signalling molecules on the 
membrane. Signalling molecules that spontaneously associate with the 
membrane become the founders of new ‘clans’. Molecules recruited to 
the membrane join the clans of their recruiters. Molecules that leave 
the membrane to re-enter the cytoplasmic pool lose their clan identity. 
A clan that has lost all its members becomes extinct. 

Initially each clan has population size one, but over time some clans 
grow whereas others become extinct. Hence the number of clans on 
the membrane decreases between occurrences of spontaneous on- 
events. At equilibrium the expected total number of signalling 
molecules at the membrane remains a (non-zero) constant. If the time 
between spontaneous on-events is sufficiently long, then the number 
of clans may decrease so much that all molecules on the membrane 
belong to a single remaining clan. For small diffusion rates, all 
molecules on the membrane will be located near their most recent 
common ancestor, resulting in the formation of a single, high-density 
cluster. (Our work bears close resemblance to the well studied 
‘stepping stone’ model? introduced to study spatially heterogeneous 
theories of population genetics and dynamics. Clans in the stepping 
stone model can be more literally interpreted as collections of 
individuals sharing a common genetic trait by descent; new mutations 
create new clans, genetic drift increases or decreases clan sizes, and 
migration disperses clan members among neighbouring colonies.) 

How long does it typically take for a single cluster to form at 
equilibrium? For relatively small membrane association rates we can 
deduce that the expected number of clans after waiting any time t is 
approximately equal to hegN/(kort) (Supplementary Information). 
Thus, after a time interval of t ~ hegN/kos, a single clan is expected if no 
further spontaneous membrane association occurs. Simulations 
indicate, however, that a single large clan can form even if a small 
number of spontaneous associations occur; most new clans rapidly 
become extinct while the dominant clan is forming. Diffusion 
eventually spreads apart the members of a dominant clan. However, at 
any given time, lineage tracking can be restarted with all original clan 
identity ‘erased’ and each membrane-bound molecule regarded as a 
new clan founder. Then, our analysis shows that after a short time a 
dominant clan will again emerge. Thus, positive feedback alone, 
operating on a small number of particles, has the capacity to 
recurrently generate polarity. 


sufficient to establish cue-independent, spontaneous polarization of 
the GTPase Cdc42 (refs 8, 9, 14). Mechanistic details of this positive 
feedback circuit are not well understood; however, several lines of 
evidence indicate its relevance as an experimental model for our 
current study: (1) self-recruitment of Cdc42 at the plasma membrane 
(via its interactions with the guanine nucleotide exchange factor 
(GEF) Cdc24 and the adaptor protein Bem1) does not require direc- 
ted transport*?'*”>**; (2) polarization is maintained in a state of 
dynamic equilibrium, with all components rapidly exchanging 
between the plasma membrane and a significant cytoplasmic pool’; 
and (3) sites of polarization have been observed occasionally to drift 
and flicker, in phenomenological agreement with our model simula- 
tions’. 

This positive feedback circuit for Cdc42 can be grossly described by 
the four fluxes in our model. Model parameter values can be esti- 
mated from experimental data (Methods), and reflect the combined 
effects of presumed unmodelled mechanistic interactions, such as 
required for self-recruitment (for example, via Bem1 and Cdc24 (refs 
8, 9, 14)) and reversible membrane association (for example, via 
guanine nucleotide disassociation inhibitor (GDI)*’””’). For varying 
numbers of signalling molecules, probabilities of polarization can be 
computed using numerical simulation. As expected from the phase 
plane diagram (Fig. 3a), the probability of spontaneous polarization 
decreases as the number of signalling molecules becomes large 
(Fig. 3b, top histogram, N> 1,000). We note that as the number 
of signalling molecules becomes very small the overall polarization 
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percentage drops sharply (Fig. 3a region 4, and Fig. 3b top histogram, 
N<500), reflecting more frequent loss of polarity due to spontan- 
eous disassociation of all membrane-bound particles and longer 
waiting times for re-establishment (Supplementary Fig. 2). 

We next assayed polarization of green fluorescent protein (GFP)- 
tagged Cdc42 in yeast cells treated with the drug latrunculin A*’ to 
inactivate an alternative actin-dependent feedback circuit®*'*! 
(Methods). To quantify the effects of varying N, cells were binned 
into different categories on the basis of background-subtracted, total 
GFP-Cdc42 intensities (Supplementary Fig. 3; see Methods). Within 
each category we computed the fraction of polarized cells (Fig. 3b). In 
general agreement with the model prediction, we observed a signifi- 
cant decrease in the frequency of polarization with increasing Cdc42— 
GFP expression (Fig. 3b, bottom histogram). Notably, this inverse 
correlation is opposite to what was modelled for transport-depend- 
ent positive feedback of constitutively activated Cdc42 in budding 
yeast. In the case of actomyosin-based feedback, increased concen- 
trations of constitutively activated Cdc42 led to increased frequency 
of establishing and maintaining distinct sites of cortical polariza- 
tion®'’. Thus, with respect to increasing large numbers of signalling 
molecules, models suggest opposite trends in the frequency of polar- 
ization for positive feedback circuits that do and do not make use of 
directed feedback. 

The intrinsic ability of positive feedback to polarize spontaneously 
without requiring additional mechanisms is not apparent from 
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Figure 3 | Dependence of polarization on model parameters. a, Illustration 
of phase-plane portrait. Theory reveals robust parameter regimes for cluster 
formation (Supplementary Information, Summary): 1, polarization; 2, no 
polarization; 3, transition zone; 4, too few signalling molecules for 
polarization (Supplementary Fig. 2). b, Frequency of observing Cdc42 
polarization in yeast for increasing numbers of molecules (N). Shown are 
polarization frequencies estimated from numerical simulations (top panel) 
and experimental observation of yeast cells expressing GFP—Cdc42 (middle 
panel) (Methods and Supplementary Fig. 3). Green error bars represent 
standard error from four independent replicates. Bottom panel: examples of 
polarized and unpolarized yeast cells. Arrows point to polarized regions. 
Scale bar, 1.9 um. 
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established deterministic models of pattern formation** 
(Supplementary Information). Our work suggests that positive feed- 
back can act alone at the core of complex signalling networks with 
small numbers of molecules to create and maintain a highly localized 
cortical distribution required for activation of downstream biological 
processes. 


METHODS SUMMARY 

Mathematical models. Deterministic and stochastic formulations of the model 
are given in Methods and Supplementary Information. 

Numerical simulations. Simulations were performed using parameter values 
estimated from published experiments*'> (Methods): D~ 1.2 um? min” '; 
N~ 10°; Neg = 0.15 kore = 9 min ; kg, 10min '; and ko, as indicated in each 
figure. Numerical simulations were carried out on one-dimensional circles. 
Experimental assay. The presence of GFP—Cdc42 polar caps were assayed in 
yeast as previously described*’° (Supplementary Fig. 3). GFP—Cdc42 intensity 
scores for each cell region were computed using Image J, and probabilities of cap 
formation computed using custom Matlab software. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Deterministic formulation of model. The fraction h(¢) of all molecules on the 

membrane changes in time t according to the ordinary differential equation: 
dh 
dt 


The density of membrane-bound molecules u(x,t) evolves in time according to 
the partial differential equation: 


k,(1—h) + kq(1—h)h—kegeh 


Ou 
ot 


1 
5 VP ut kon (l h) + kq(1 —h)u— koe 


where V’ is the diffusion operator on the plasma membrane. 
Stochastic formulation of model. In the stochastic model we consider N mole- 
cules in a cell, n(t) of which are on the membrane at any given time. The molecules 
on the membrane have positions x(t), ..., X,c)(f), which undergo Brownian 
motion with diffusion coefficient D. The number of molecules on the membrane 
can also change randomly in three different ways: (1) A molecule on the mem- 
brane may dissociate itself from the membrane (rate constant ko); (2) a molecule 
on the membrane may ‘recruit’ some molecule in the cytosol to its location on the 
membrane (rate constant kg,); (3) a molecule from the cytosol may spontaneously 
associate itself to some random location on the membrane (rate constant k,,). 
The probability p,(t) of mn molecules appearing on the membrane at 


. ; . dp,(t 
time t changes according to the master equation: Pai ) _w.—w,_, 


where W,, = koge (1+ 1) Pn+41 — (kon + kypn/N)(N —1)p, while the number n(t) 
of molecules on the membrane evolves by the Markov process determined by 
the master equation. See Supplementary Information for a more detailed 
description of the stochastic model. 

Numerical simulations. Whereas for the theoretical analysis the membrane may 
be a one-dimensional curve or a two-dimensional surface of arbitrary geometry, 
numerical simulations were carried out on one-dimensional circles for ease of 
visualization. Simulation results shown in the figures are based on model para- 
meter ranges estimated from experimental measurements of polarized Cdc42 in 
budding yeast*’*. For a yeast cell of radius R~5 1m, the plasma membrane 
diffusion constant of Cdc42 was estimated to be D=0.001 
(2nR)? min | ~ 1.2 um? min ! (=5x10 '°cm?s'). We assumed _ total 
Cdc42 numbers on the order of N~10° with plasma membrane frac- 
tion heq ~ 0.1. Fluorescence recovery after photobleaching (FRAP) analysis of 
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half-time recovery for polarized Cdc42 (as well as Cdc24, and Bem1) was on the 
order of ~5s (ref. 8); thus we estimated an off-rate of kog= (In2)/ 
Tin ~9min |. A feedback strength of kg, = 10 min ! was used to obtain an 
equilibrium plasma membrane fraction heg = (1 — ko ky) ~ 0.1. On-rate values 
kon were varied as indicated in each figure legend. Clustering is predicted to occur 
for kon <ka(heq( — Iheqg)N’) ~10 ‘min |, although simulations indicate 
that clustering can even occur for k,, less than 10° ° (Fig. 3b and 
Supplementary Information). Polarization in simulations was determined by 
whether an interval of 10% of the membrane contained more than 50% of total 
membrane content; error bars represent standard errors from 4 replicates of 
1,000 independent simulations (Fig. 3b, top panel). 

Release assay. Cells from yeast strain RLY1948 (a gift from R. Li), which contain 
inducible GFP—Cdc42 expression under the Gall/10 promoter, were grown 
logarithmically in synthetic complete (SC) medium without methionine*”. 
To arrest cells in Gl, cells were washed three times with autoclaved water, 
resuspended into YP medium supplemented with 2mM methionine and 2% 
raffinose and distributed into four tubes. Cells were then cultured for 150 min 
(tube 1), 135 min (tube 2), 120 min (tube 3) and 90 min (tube 4). To induce 
expression of GEP—Cdc42, 2% galactose was added to each tube and cells grown 
for an additional 30 min (tube 1), 45 min (tube 2), 60 min (tube 3) and 90 min 
(tube 4). To release from G1 arrest, cells were washed three times with water and 
resuspended in methionine-free SC medium containing 2% glucose plus 150 1M 
latrunculin A. The released cells were seeded in glass-bottomed microwell dishes 
(MatTek) and coated with concanavalin A for 10 min before imaging. 
Microscopy. Live cell microscopy was performed on a Nikon epifluorescence 
microscope (Nikon TE-2000E2) equipped with a 100 plan apo objective, a 
cooled CCD camera (model COOLSNAP HQ; Photometrics Instruments) and 
a temperature control box set at 30°C. Image acquisition was performed with 
Metamorph software (Universal Imaging Corp.) with 1 X 1 binning. Images 
were taken in one focal plane at different locations in the dishes. For each 
condition, images were taken during a 30 min period. 

Image analysis. Six images from each condition were selected for analysis yield- 
ing 145, 243, 211 and 228 individual cells, respectively. Using ImageJ, we manu- 
ally identified cell regions and computed a background-subtracted, averaged 
GFP-Cdc42 intensity score. The presence of a Cdc42 polar cap in each cell 
was assessed (Supplementary Fig. 3), and probabilities of cap formation com- 
puted using custom Matlab software. 
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Drosophila RNAi screen identifies host genes 
important for influenza virus replication 


Linhui Hao’?*, Akira Sakurai>*+, Tokiko Watanabe’, Ericka Sorensen’, Chairul A. Nidom?®, Michael A. Newton’, 


Paul Ahlquist'” & Yoshihiro Kawaoka®’”’*” 


All viruses rely on host cell proteins and their associated mecha- 
nisms to complete the viral life cycle. Identifying the host mole- 
cules that participate in each step of virus replication could pro- 
vide valuable new targets for antiviral therapy, but this goal may 
take several decades to achieve with conventional forward genetic 
screening methods and mammalian cell cultures. Here we describe 
a novel genome-wide RNA interference (RNAi) screen in 
Drosophila’ that can be used to identify host genes important 
for influenza virus replication. After modifying influenza virus 
to allow infection of Drosophila cells and detection of influenza 
virus gene expression, we tested an RNAi library against 13,071 
genes (90% of the Drosophila genome), identifying over 100 for 
which suppression in Drosophila cells significantly inhibited or 
stimulated reporter gene (Renilla luciferase) expression from an 
influenza-virus-derived vector. The relevance of these findings to 
influenza virus infection of mammalian cells is illustrated for a 
subset of the Drosophila genes identified; that is, for three impli- 
cated Drosophila genes, the corresponding human homologues 
ATP6V0D1, COX6A1 and NXF1 are shown to have key functions 
in the replication of HSN1 and H1N1 influenza A viruses, but not 
vesicular stomatitis virus or vaccinia virus, in human HEK 293 
cells. Thus, we have demonstrated the feasibility of using 
genome-wide RNAi screens in Drosophila to identify previously 
unrecognized host proteins that are required for influenza virus 
replication. This could accelerate the development of new classes 
of antiviral drugs for chemoprophylaxis and treatment, which are 
urgently needed given the obstacles to rapid development of an 
effective vaccine against pandemic influenza and the probable 
emergence of strains resistant to available drugs. 

Influenza, a highly contagious disease of birds and mammals, is 
caused by negative-strand RNA viruses of the family Orthomyxo- 
viridae. Influenza outbreaks kill millions of people worldwide during 
pandemic years and hundreds of thousands during other years. Since 
their first lethal infection of humans in 1997, H5N1 influenza A 
viruses have spread throughout Asia and to Europe and Africa, pos- 
ing a major risk for a new influenza pandemic’. To provide rational 
bases for improved treatment and control of influenza virus infec- 
tion, we sought to advance understanding of viral infection mecha- 
nisms by elucidating previously unknown  virus—host cell 
interactions. Many steps in the viral life cycle, including intracellular 
trafficking, gene expression, replication and virion assembly, depend 
on interactions with specific host cell gene products. Although most 
such host molecules remain elusive, emerging results indicate that 


their identification and characterization can provide new insights 
into the mechanisms by which viruses complete their life cycle, and 
hence illuminate potentially valuable targets for prophylactic and 
therapeutic intervention*>. 

Systematic, genome-wide RNAi analysis offers an exciting tool to 
identify host genes that function in viral replication. Such analysis is 
facilitated by well-developed model systems such as Drosophila, the 
genome of which contains only ~14,000 genes, nearly all of which 
can be specifically targeted for high efficiency messenger RNA deple- 
tion by double-stranded RNA (dsRNA) libraries’. Because of its 
powerful genetics and conservation with vertebrates, Drosophila 
has been used to make numerous critical contributions to mam- 
malian cell biology®°®. Thus, in principle, Drosophila RNAi studies 
could accelerate identification of host interactions essential for influ- 
enza virus replication. 

Because Drosophila D-Mel2 cells do not express the human influ- 
enza Virus receptor 02,6-linked sialic acid (Supplementary Fig. 1), we 
predicted that wild-type human influenza virus would not be able to 
infect them. Indeed, we did not detect viral protein expression by 
immunofluorescence assays in Drosophila D-Mel2 cells inoculated 
with influenza virus A/WSN/33 (WSN; H1N1) (data not shown). 
To bypass this block to wild-type influenza virus entry, we generated 
a genetically modified virus, Flu-VSV-G-GFP (FVG-G), in Madin- 
Darby canine kidney (MDCK) cells by replacing the receptor-bind- 
ing haemagglutinin (HA) and neuraminidase (NA) genes with genes 
encoding vesicular stomatitis virus glycoprotein G (VSV-G) and 
enhanced green fluorescence protein (eGFP), respectively’ 
(Fig. la). Because the envelopes of the resulting virions bear 
VSV-G, which mediates entry into mammalian, Drosophila and other 
cells'’, FVG-G virions should readily infect Drosophila cells. Twenty- 
four hours after infection, GFP fluorescence was detected in FVG-G- 
infected Drosophila D-Mel2 cells (Supplementary Fig. 2). We also 
confirmed by real-time polymerase chain reaction (PCR) that influ- 
enza virus RNA replication occurred in Drosophila cells 
(Supplementary Fig. 3). However, Drosophila cells infected with 
FVG-G did not release detectable virions into the medium, as assayed 
by infectivity tests on MDCK cells and by electron microscopy (data 
not shown). This was due, at least in part, to failure of Drosophila cells 
to express some viral proteins required for virion assembly and infec- 
tivity (Supplementary Fig. 4). Thus, Drosophila cells can support 
influenza virus replication from post-entry to at least the protein 
expression phase of the viral life cycle. This span encompasses mul- 
tiple other steps in the life cycle, including cytoplasmic release of 
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genomic RNA-containing viral ribonucleoprotein complexes 
(vRNPs), VRNP import into the nucleus, mRNA synthesis from the 
negative-strand viral RNA genome, mRNA export to the cytoplasm 
and translation. 

For high-throughput, functional genomics analysis of influenza 
virus replication in Drosophila cells, we engineered Flu-VSV-G- 
R.Luc (FVG-R), in which VSV-G and Renilla luciferase genes 
replaced the viral HA and NA open reading frames (Fig. 1b). FVG- 
R virions were then used with an RNAi library (Ambion) against 
13,071 Drosophila genes (~90% of all genes) to identify host genes 
affecting influenza-virus-directed Renilla luciferase expression 
(Fig. 1c). Two independent tests of the entire library were performed 
(Supplementary Table 1). For 176 genes for which dsRNAs inhibited 
FVG-R-directed luciferase expression in both replicates, repeated 
secondary tests using alternate dsRNAs to control for possible off- 
target effects confirmed the effects of 110 genes (Supplementary 
Tables 2 and 3). This confirmation rate is comparable to that in a 
Drosophila screen with a natural Drosophila-infecting virus?. 
Cell viability testing identified six genes with potentially significant 


Incorporation signal of HA segment 


| | 


Incorporation signal of NA segment 


Incorporation signal of HA segment 


| \ 


Incorporation signal of NA segment 


Add dsRNA of the Drosophila RNAi library 
(targeting to 13,071 Drosophila genes) to 
each well of 384-well microplates 


Add DL1 cells to the plates 


Infect with FVG-R virus 


Measure Renilla luciferase activity 
to assess the efficiency of virus 
replication 


Figure 1| Overview of genome-wide RNAi screen to identify host factors 
involved in influenza virus replication in Drosophila cells. a, b, Schematic 
diagrams showing recombinant influenza viruses. Shown are FVG-G, in 
which genes encoding the HA and NA proteins were replaced with the VSV- 
Gand eGFP genes, respectively (a), and FVG-R, in which the genes encoding 
the HA and NA were replaced with the VSV-G and Renilla luciferase genes, 
respectively (b). c, Schematic diagram of the systematic analysis of host 
genes affecting influenza virus replication and gene expression in Drosophila 
cells. Experimental details are given in Methods. 
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cytotoxic effects; these were excluded from further consideration 
(Supplementary Information and Supplementary Table 3). 
Secondary tests of candidate genes for which dsRNAs increased 
FVG-R-directed luciferase expression produced a much lower con- 
firmation rate, suggesting a higher rate of off-target or other false- 
positive effects in this class (Supplementary Information and 
Supplementary Table 4). 

Among the over 100 candidate genes found to be important for 
influenza virus replication in Drosophila cells, we selected the human 
homologues of several encoding components in host pathways/ 
machineries that are known to be involved in the life cycle of influ- 
enza virus, for example, ATP6VOD1 (endocytosis pathway), COX6A1 
(mitochondrial function) and NXFI (mRNA _ nuclear export 
machinery), for further analysis in mammalian cells to assess the 
relevance of our Drosophila results'*"'’. ATP6VOD1 encodes subunit 
D of vacuolar (H* )-ATPase (V-ATPase), a proton pump that func- 
tions in the endocytosis pathway (that is, the acidification and fusion 
of intracellular compartments'*). COX6A1 encodes a subunit of cyto- 
chrome c oxidase (COX), an enzyme of the mitochondrial electron 
transport chain that catalyses electron transfer from cytochrome c to 
oxygen’’. NXFI encodes a nuclear export factor critical for exporting 
most cellular mRNAs containing exon—exon junctions’. 

As a first test for the possible contribution of these gene products 
to influenza virus replication in mammalian cells, we treated human 
HEK 293 cells twice at 24-h intervals with short interfering RNAs 
(siRNAs; siGENOME, Dharmacon) against the human homologue 
of each selected Drosophilia gene. Twenty-four hours after the second 
siRNA treatment, the cells were infected with FVG-R virus and, two 
days later, Renilla luciferase activity was measured to assess viral 
replication and gene expression. siRNA against ATP6VODI1 or 
COX6A1 markedly decreased Renilla luciferase activity (Fig. 2a), 
but not cell viability (Supplementary Fig. 5a), suggesting that these 
genes have important roles in influenza virus replication in mam- 
malian cells, as in Drosophila cells. Inhibition was not caused by off- 
target effects because, for each gene, each of four distinct siRNAs 
inhibited FVG-R-directed expression of Renilla luciferase 
(Supplementary Table 6). Because COX6AI encodes a subunit of 
mitochondrial electron transport chain complex IV, COX, we used 
specific inhibitors to test whether in HEK 293 cells influenza virus 
also required other complexes in this chain (Fig. 2c). Inhibitors of 
complexes III, IV and V selectively inhibited FVG-R-directed Renilla 
luciferase expression by 50- to 100-fold, whereas complex I and II 
inhibitors had little or no effect. Thus, in mammalian cells, influenza 
virus depends on multiple late stages but not early stages in the 
mitochondrial electron transport chain. 

Treatment for four days with siRNA against NXFI decreased 
mammalian cell viability (data not shown), as predicted by the criti- 
cal role of NXF1 in general host cell metabolism. Accordingly, the 
total incubation time with siRNA against NXF1 was shortened to 36 h 
by transfecting cells with the siRNA twice at a 12-h interval, infecting 
with FVG-R virus 12 h later, and assaying for Renilla luciferase at 12-h 
post-infection. Under these conditions, cell viability was not detect- 
ably affected (Supplementary Fig. 5b) whereas Renilla luciferase 
activity was reduced by nearly fivefold (Fig. 2b). Whereas recent 
results indicated that influenza virus protein NS1 binds to NXFI to 
inhibit host mRNA export”’, these results imply that influenza virus 
RNAs and/or proteins are transported by an NXF1-dependent path- 
way (see also Supplementary Information). 

To test the effects of these genes on authentic influenza viruses, we 
infected siRNA-treated HEK 293 cells with WSN virus or H5N1 influ- 
enza A/Indonesia/7/05 (Indonesia 7; isolated from a patient) or with 
VSV or vaccinia virus as controls. Progeny viruses were collected from 
the medium at 24h (Indonesia 7, VSV or vaccinia virus) or 48h 
(WSN) post-infection and were titrated. Depleting ATP6VOD1 and 
COX6A1 did not affect VSV or vaccinia virus replication, but 
decreased the WSN and Indonesia 7 virus yields by ~10-fold or more 
(Fig. 3a). Thus, ATP6VOD1 and COX6A1 are required for replication 
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of influenza viruses but not VSV and vaccinia virus. Depleting the 
NXF1 nuclear export factor, again using an accelerated 36-h time line, 
reduced Indonesia 7 virus titres by 20-fold (Fig. 3b), indicating that 
NXF1 has a critical role in H5N1 influenza virus replication. 
Unfortunately, because WSN grows in HEK 293 cells more slowly than 
does Indonesia 7 virus, the WSN virus yield at 12-h post-infection was 
insufficient to test for an effect of NXFI depletion (data not shown). In 
comparison to Indonesia 7 virus, NXFI depletion had no effect on 
VSV or vaccinia virus yields (Fig. 3b). Moreover, siRNA against NXF1 
inhibited FVG-G, but not adenovirus, which, like influenza virus, 
depends on nuclear steps for genome transcription and replication 
(Supplementary Fig. 6). Thus, multiple genes identified in Drosophila 
cells correspond to important, selective host factors for influenza virus 
replication in mammalian cells. 

The method that we have established, using systematic analysis ofa 
Drosophila RNAi library with confirmation in mammalian cells, can 
be used to identify host gene products that affect influenza virus 
replication. This utility was demonstrated in experiments with 
authentic influenza viruses, including an H5N1 virus isolated from 
a human patient. One of the candidate genes tested in mammalian 
cells, COX6A1, which encodes a subunit of COX, acts as a critical 
enzyme in cytochrome-c-dependent electron transport in mitochon- 
dria. Influenza virus PB2 polymerase has a mitochondrial targeting 
signal'*, and PB1-F2, the second protein encoded by the influenza 
PB1 gene, was reported to localize to inner and outer mitochondrial 
membranes and to delay influenza virus clearance by host antiviral 
responses’*. Thus, COX6A1 may be involved in PB2- and/or PB1-F2- 
mediated functions in mitochondria. Additionally, influenza VRNP 
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Figure 2 | Effect of selected siRNAs and inhibitors on Renilla luciferase 
expression in FVG-R-infected human cells. a—c, Renilla luciferase activity 
was measured in FVG-R-infected HEK 293 cells treated with siRNAs against 
ATP6VOD1 and COX6A1 (a), NXFI (b) or the indicated mitochondrial 
electron transport chain inhibitors (c). Inhibitors of complexes II, IV and V 
inhibited FVG-R-directed Renilla luciferase expression significantly, 
whereas complex I and II inhibitors had little or no effect. In contrast, the 
inhibitors had no significant effects on cell viability and Gaussia luciferase 
expression of a murine leukaemia virus derivative (MLV-GL) that, like FVG- 
R, depended on VSV G-mediated entry. All experiments were conducted 
three times in duplicate, with the results reported as mean + s.d. 3-NPA, 
3-nitroproprionic acid. 
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Figure 3 | Effect of siRNAs against selected genes on the replication of 
influenza viruses, VSV or vaccinia virus in human HEK 293 cells. a, b, The 
titres of influenza viruses (WSN and Indonesia 7), VSV and vaccinia virus in 
HEK 293 cells treated with siRNA against ATP6VOD1 and COX6AI (a) or 
against NXF1 (b) are shown. Experimental details are given in Methods and 
Supplementary Methods. All experiments were conducted three times, with 
the results reported as mean + s.d. 


export from the nucleus requires caspase 3 activation”, which can be 
induced by cytochrome c release from mitochondria. Moreover, 
human BCL2 inhibits both cytochrome c release and influenza 
vRNP export”, and COX function and changes in mitochondrial 
membrane potential have been linked to caspase activation™*. These 
pathways may underlie our further findings (Fig. 2c) that influenza 
virus replication and expression in mammalian cells are strongly and 
selectively modulated by compounds that inhibit COX, cytochrome 
c-linked electron transport chain complex III, and normal ion trans- 
port across mitochondrial membranes (oligomycin and valinomy- 
cin). Potential roles of the other two host genes confirmed here in 
mammalian cells, ATP6VOD1 and NXF1, in influenza virus replica- 
tion are discussed further in the Supplementary Information. 

The above results with multiple, diverse genes, including 
ATP6VOD1, NXF1 and COX6A1 as well as mitochondrial electron 
transport complexes HI and V, demonstrate the feasibility and value 
of using Drosophila RNAi screening to identify previously unknown 
host factors with important and potentially unsuspected roles in 
influenza virus replication. Simultaneously, the genome-wide results 
from our Drosophila RNAi screen provide more than 100 additional 
candidate genes (many with unknown functions) to be tested in 
mammalian cells. We suggest that the same strategy could be applied 
to identify previously unknown host factors involved in the replica- 
tion of other viruses, whenever at least a portion of their replication 
cycle is supported by Drosophila cells. 


METHODS SUMMARY 

Cells and viruses. MDCK cells, HEK 293 cells, baby hamster kidney (BHK) cells, 
D-Mel2 cells and DL1 cells were maintained as described in Methods. WSN, 
FVG-G, FVG-R and Indonesia 7 viruses were generated as described previously’” 
and propagated in MDCK cells. VSV and vaccinia virus were grown in BHK and 
CV-1 cells, respectively. Gaussia-luciferase-expressing murine leukaemia virus 
(MLV-GL) and the GFP-expressing adenovirus were provided by J. Bruce and R. 
Kalejta”’, respectively. 

Drosophila RNAi library analysis. Double-stranded RNA of the Drosophila 
RNAi library (targeting 13,071 Drosophila genes) and DL1 cells were added to 
each well of 384-well microplates. After two days of incubation, cells were 
infected with the FVG-R virus. At one day post-infection, Renilla luciferase 
activity was measured as described below. Two independent analyses of the 
entire library were performed. 
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siRNA treatment of mammalian cells. HEK 293 cells were transfected with 
siRNA by TransIT-TKO (Mirus). Cells were incubated for 12h in experiments 
with NXF1 or for 24h with other genes, were transfected again under the same 
conditions, and inoculated with virus after 12 h for NXF1 or after 24h for other 
genes. After a further 12h (for NXFI) or 48h (other genes), the cells were 
harvested for the indicated analyses. 

Renilla luciferase and cell viability assays. Renilla luciferase activity and cell 
viability were measured with established Renilla luciferase and CellTiter-Glo 
assay systems (Promega) according to the manufacturer’s instructions; signals 
were read with a GLOMAX 96 microplate luminometer. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Cells and viruses. MDCK cells, HEK 293 cells and BHK cells were maintained in 
minimum essential medium containing 5% fetal calf serum and antibiotics at 
37°C in 5% CO . D-Mel2 cells were maintained in Drosophila-SFM (GIBCO/ 
Invitrogen) at 28 °C. DL1 cells were maintained in Schneider’s Drosophila (SD) 
medium containing 10% FBS at 28 °C. 

WSN, FVG-G, FVG-R and Indonesia 7 viruses were generated by a plasmid- 

based reverse genetics system’? and were grown and titrated in MDCK cells. 
MLV-GL was produced by replacing the CD4 open reading frame in 
pCMMP-CD4-eGFP”* with that of Gaussia luciferase (J. Bruce, unpublished 
observations). All experiments with Indonesia 7 virus were conducted in a bio- 
safety level 3 containment laboratory approved for such use by the Centers for 
Disease Control and Prevention and the US Department of Agriculture. 
Drosophila RNAi library analysis. The Drosophila RNAi library (Ambion) con- 
tained 13,071 individual dsRNAs, each designed to specifically target a single 
Drosophila gene. Five microlitres of dsRNA (40 ng pl!) were added to each well 
of 384-well plates, after which 2 X 10* DLI cells in 10 pl of SD medium were 
added to each well and incubated with the dsRNA at 28°C for 60 min. 
Twenty microlitres of SD medium containing 20% FBS was then added to each 
well after incubation. Cells were treated with dsRNA for two days at 28 °C and 
were then inoculated with an amount of FVG-R virus corresponding to a mul- 
tiplicity of infection of 10 for MDCK cells, and were then transferred to 33 °C. At 
one day post-infection, Renilla luciferase activity was measured as described 
below. Two independent analyses of the entire library were performed. 
siRNA treatment of mammalian cells. The non-targeted siRNA used was 
siCONTROL 1 (Dharmacon). Duplex siRNAs against candidate genes were 
obtained from siGENOME (Dharmacon, see Supplementary Table 7 for 
sequences); the effects of siRNA were evaluated by reverse transcription followed 
by PCR (RT-PCR; Supplementary Fig. 7). The sequences of siRNA against the 
NP gene of influenza virus (GGAUCUUAUUUCCUUCGGAGUU)” and the 
E3L gene of vaccinia virus (AAUAUCGUCGGAGCUGUACAC)”* were reported 
previously. The sequence of siRNA against the gene encoding the L protein of 
VSV (CGAGUUAUCCAGCAAUCAUUU) was designed using BLOCK-iT RNAi 
designer (Invitrogen). A HEK 293 cell suspension was seeded into the wells of a 
24-well plate (2.0 X 10° cells per well), incubated for 1h and transfected with 
siRNA (to a final concentration 10nM; Dharmacon) using TransIT-TKO 
(Mirus), according to the manufacturer’s instructions. 
Inhibitor treatment of mammalian cells. Mitochondrial electron transport 
chain inhibitors were used at the following final concentrations: rotenone 
(0.511M), 3-nitroproprionic acid (1mM), antimycin A (25M), NaN; 
(50mM), oligomycin (25 UM) and valinomycin (0.9 14M). HEK 293 cells were 
treated with the inhibitors for 5 h before being infected with FVG-R or MLV-GL, 
incubated 18h, and assayed for Renilla or Guassia luciferase, respectively. Cell 
viability was assayed in parallel by a firefly-luciferase-mediated ATP assay 
(CellTiter-Glo, Promega). 


26. Bruce, J. W., Bradley, K. A., Ahlquist, P. & Young, J. A. Isolation of cell lines that 
show novel, murine leukemia virus-specific blocks to early steps of retroviral 
replication. J. Virol. 79, 12969-12978 (2005). 

27. Ge, Q. et al. RNA interference of influenza virus production by directly targeting 
mRNA for degradation and indirectly inhibiting all viral RNA transcription. Proc. 
Natl Acad. Sci. USA 100, 2718-2723 (2003). 

28. Dave, R. S. et al. siRNA targeting vaccinia virus double-stranded RNA binding 
protein [E3L] exerts potent antiviral effects. Virology 348, 489-497 (2006). 


©2008 Macmillan Publishers Limited. All rights reserved 


nature 


LETTERS 


Vol 454|14 August 2008|doi:10.1038/nature07199 


Transient FTY720 treatment promotes immune- 
mediated clearance of a chronic viral infection 


Mary Premenko-Lanier’, Nelson B. Moseley’, Sarah T. Pruett*, Pablo A. Romagnoli’ & John D. Altman’ 


For a wide variety of microbial pathogens, the outcome of the 
infection is indeterminate. In some individuals the microbe is 
cleared, but in others it establishes a chronic infection, and the 
factors that tip this balance are often unknown. In a widely used 
model of chronic viral infection, C57BL/6 mice clear the 
Armstrong strain of lymphocytic choriomeningitis virus 
(LCMV), but the clone 13 strain persists’*. Here we show that 
the Armstrong strain induces a profound lymphopenia at days 
1-3 after infection, but the clone 13 strain does not. If we transi- 
ently augment lymphopenia by treating the clone-13-infected 
mice with the drug FTY720 at days 0-2 after infection, the mice 
successfully clear the infection by day 30. Clearance does not occur 
when CD4 T cells are absent at the time of treatment, indicating 
that the drug is not exerting direct antiviral effects. Notably, 
FTY720 treatment of an already established persistent infection 
also leads to viral clearance. In both models, FTY720 treatment 
preserves or augments LCMV-specific CD4 and CD8 T-cell res- 
ponses, a result that is counter-intuitive because FTY720 is gen- 
erally regarded as a new immunosuppressive agent’. Because 
FTY720 targets host pathways that are completely evolutionarily 
conserved, our results may be translatable into new immunothera- 
pies for the treatment of chronic microbial infections in humans. 

High-level persistent infections cause a variety of immune system 
dysfunctions*”, and the initial focus of our study was to assess their 
effects on bystander naive T cells. For these experiments, we used the 
established LCMV model and infected mice with either the 
Armstrong or the clone 13 strain of LCMV. In the course of these 
experiments we observed that Armstrong infection, as expected’, 
induced an early and profound transient decrease in the total number 
of lymphocytes circulating in the blood (lymphopenia); notably, 
clone 13 infection induced lymphopenia that was much less pro- 
found (Fig. 1a and Supplementary Fig. 1a, b). Sequestration contrib- 
uted significantly to the Armstrong-induced lymphopenia because a 
sentinel population of adoptively transferred F5 T-cell-receptor 
transgenic cells, specific for an influenza epitope not found in 
LCMV”, disappeared from the blood 1-3 days after infection but 
re-appeared between days 3 to 4 after infection (Supplementary 
Fig. 2a). In contrast, the total number of F5 cells in the blood and 
lymphoid organs remained constant, demonstrating that the blood 
lymphopenia was not due to global lymphocyte death in the 
Armstrong-infected mice (Supplementary Fig. 2b). During the per- 
iod of lymphopenia, lymphocytes were quantified in secondary 
lymphoid tissues (Supplementary Fig. 3). Lymphocytes were 
observed to accumulate in the lymph nodes after both Armstrong 
and clone 13 infections, but the quantities were greater in the 
Armstrong-infected mice. In general, the total number of CD4 and 
CD8 T cells, and CD19 B cells, were decreased in the spleens of mice 
infected with both virus strains. 


Sequestration of cells in lymph nodes is regulated at the level of both 
entry and exit, and the exit pathway is regulated by a G-protein- 
coupled receptor for sphingosine-1-phosphate (S1P), called S1P, 
(ref. 10). Although high quality reagents for directly measuring S1P, 
levels are not readily available, CD69 has recently been reported to 
downregulate S1P,, and upregulation of CD69 is induced by type I 
interferon (IFN-I)*. Seeking an explanation for the reduced lympho- 
penia after clone 13 infection, we performed daily measurements of 
the amounts of IFN-I and S1P in the serum of clone-13- and 
Armstrong-infected mice and uninfected controls; at the same time 
points, we also measured CD69 expression on lymphocytes. 
Compared to clone-13-infected mice, mice infected with the 
Armstrong strain had slightly higher serum levels of IFN-I that were 
significant on day 1 after infection (Fig. 1b, P = 0.0001), slightly lower 
amounts of serum S1P (Fig. lc, P= 0.0473, day 1), and slightly higher 
amounts of S1P in their lymph nodes (Fig. 1d, P = 0.0271, day 1, and 
P= 0.0012, day 3). However, despite the differences in IFN-I levels, 
there was no difference in the frequency of cells that carry the CD69 
antigen in the blood, nor a difference in the levels of CD69 expression 
on CD8 T cells (Fig. le), CD4, or CD19 cells (data not shown). 
Therefore, we still lack an understanding of the causes of the difference 
in lymphopenia after infection with the two strains of LCMV. 

The difference in lymphopenia after infection with Armstrong and 
clone 13 strains raises the question as to whether the immune system 
would be able to clear a clone 13 infection if we could artificially induce 
amore profound but still transient lymphopenia. To force the sequest- 
ration of lymphocytes we chose the immunomodulatory pro-drug 
FTY720, well known for its ability to sequester lymphocytes in lymph 
nodes and Peyer’s patches'’. When phosphorylated, FTY720-P is an 
SIP analogue capable of binding to four of the five S1P receptors. 
FTY720 has been tested in human clinical trials for the treatment of 
relapsing multiple sclerosis'* and the prevention of kidney transplant 
rejection’’, and it is widely regarded as a new class of immunosuppres- 
sive agent. In contrast to traditional immunosuppressive drugs, 
FTY720 is thought to work by manipulating lymphocyte location 
and has been shown to have no direct effect on the activation of T cells’. 

To force the early sequestration of naive T cells during clone 13 
infection, FTY720 was administered to mice intravenously at doses 
ranging from 0.8 ugkg~' to 1mgkg~'. In an attempt to mimic the 
transient lymphopenia seen after infection with the Armstrong 
strain, we chose a treatment regimen in which FTY720 was adminis- 
tered once a day for 3 days beginning at the time of infection. At low 
FTY720 doses, T cells were sequestered for 3 days after infection, but 
returned to the circulation by day 4; at higher drug doses, they 
remained sequestered for at least an extra day (Supplementary Fig. 
4a, b). Furthermore, low doses of FTY720 caused a significant reduc- 
tion in clone 13 serum titres at 8 days after infection, but the highest 
dose (1mgkg~') did not (Supplementary Fig. 4c). On the basis of 
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Figure 1| LCMV Armstrong but not clone 13 induces profound 
lymphopenia at days 2 and 3 after infection. Wild-type mice were infected 
intravenously (i.v.) at 2 X 10° p.f.u. with LCMV Armstrong strain (filled 
squares; Arm), clone 13 strain (filled triangles; Cl-13), or were uninfected 
(filled diamonds) and measurements were taken on successive days after 
infection. a, The concentration of lymphocytes (CD8 T cells) per ml of blood 


these results, we chose to use a dose of 4ugkg! for all subsequent 
experiments. Although this dose causes lymphopenia in clone-13- 
infected mice, it does not in uninfected controls (data not shown), 
suggesting that there is some synergy between the drug and the 
infected state. Also at this dose, FTY720 does not alter the concentra- 
tions of S1P in the serum or lymph nodes, the IFN-I amount in serum 
or the expression of CD69 on lymphocytes (data not shown). FTY720 
causes a transient and statistically significant increase in the numbers 
of T and B cells in the mesenteric lymph nodes in clone-13-infected 
mice (P = 0.0267); drug-associated increases in the numbers of lym- 
phocytes are also seen in the peripheral lymph nodes, but these are 
not statistically significant (Supplementary Fig. 5a). In experiments 
using adoptively transferred LCMV-specific P14 T-cell-receptor 
transgenic cells, FTY720 treatment induced a significant increase in 
the numbers of P14 cells in the lymph nodes at days 3 and 4 after 
infection, consistent with the hypothesis that sequestration acceler- 
ates priming of the adaptive CD8* T-cell response (Fig. 2a). 

The data in Supplementary Fig. 4c prompted us to test more 
extensively the efficacy of FTY720 treatment of clone-13-infected 
mice. We focused on two measures of efficacy: viral titres in the 
serum, brain and kidneys, and T-cell function. Beginning as soon 
as 5 days after infection, FTY720-treated mice had significantly lower 
clone 13 titres in the serum than untreated mice (P = 0.0066; Fig. 2b), 
and the drug-induced control of the virus continued until it was 
cleared by day 29 after infection. Viral titres in the kidneys and brain 
of the clone-13-infected, FTY720-treated mice are undetectable at 
day 8 and beyond, in contrast to the untreated clone-13-infected 
mice that had high viral titres in the kidney and brain up to at least 
day 60 after infection (data not shown). 

T-cell exhaustion is a hallmark of chronic clone 13 infection 
In mice infected with clone 13, cells specific for the dominant epitope 
NP396 are deleted and the T cells specific for the co-dominant GP33 
epitope become dysfunctional’. We found that mice treated with 
FTY720 at the time of clone 13 infection have functional CD8 T cells 
specific for the LCMV epitopes GP33 and NP396 throughout the 


7,15,16 


is shown. b, IFN-I amounts in the serum; IFN-I was measured using a 
standard vesicular stomatitis virus protection assay using the L929 fibroblast 
cell line**. ¢, d, S1P concentrations in serum (¢) and in lymph nodes (LN, 
d); SIP concentrations were measured by liquid chromatography tandem 
mass spectrometry; n = 3 per group”. e, CD69 expression on CD8 T cells in 
the blood. All data are representative of 2-3 experiments. Error bars are s.d. 


course of the study (Fig. 2c, d). We also analysed the LCMV-specific 
CD4 T-cell response and found that FTY720-treated but not 
untreated mice retained GP61-specific functional T cells (Fig. 2e). 
Furthermore, PD-1 expression on CD8 T cells is tightly linked to the 
presence of antigen (and T-cell exhaustion in the chronic phase of 
infection)’, and by day 15 after clone 13 infection, PD-1 amounts are 
lower on LCMV-specific cells in FTY720-treated but not untreated 
mice (Supplementary Fig. 6); this result correlates with the control of 
the infection. Neutralizing antibodies were measured on day 60 after 
infection using a simple plaque reduction assay, and FTY720 treat- 
ment did not alter their titres (data not shown). Finally, mice given 
FTY720 at the time of infection form a stable memory population, 
with fully functional CD8 T cells that are able to produce interleukin 
(IL)-2 on in vitro stimulation with the dominant epitopes NP396 and 
GP33 (Fig. 2f). Therefore, FTY720 treatment at the time of clone 13 
infection leads to clearance of the virus and preservation of LCMV- 
specific T-cell responses. 

We next investigated whether short-term FTY720 treatment could 
clear an already established clone 13 infection. Mice were infected 
with clone 13 and treated with FTY720 at 4ugkg~! on days 30, 31 
and 32 after infection. By day 30 after treatment—60 days after infec- 
tion—the virus was cleared from the kidneys and brain (Fig. 3a; 
P= 0.0008 unpaired t-test), organs that harbour virus for the lifetime 
of the untreated mouse. Within 2 weeks of FTY720 treatment, NP396 
CD8 T-cell specific peptide responses and GP61 CD4 T-cell specific 
responses were detectable in the treated mice but not in the untreated 
controls (Fig. 3b); GP33-specific responses were detected in 
untreated mice but were significantly enhanced after treatment, as 
detected by both cytokine production (Fig. 3b) and tetramer staining 
(Fig. 3c). At 4weeks after treatment, GP33-tetramer-positive cells 
in lymph nodes had downregulated expression of PD-1 and 
upregulated expression of CD62L, indicating a transition from 
effector to memory T cells (Fig. 3d). 

The expansion of GP33-specific cells after FTY720 treatment 
during the persistent phase of infection was also seen in mice 
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Figure 2 | Low-dose short-term FTY720 treatment of clone-13-infected 
mice prevents the establishment of a persistent LCMV infection. Wild- 
type mice were infected intravenously with 2 x 10° p.f.u. of LCMV 
Armstrong (Arm) or clone 13 (Cl-13). FTY720 was administered 
intravenously at 4 ugkg | on days 0, 1 and 2 after infection. For a only, 

1 X 10° P14 cells were transferred 48 h before infection. a, Total P14 cells in 
lymph nodes (LN) on days 3 (left) and 4 (right) after infection. b, Viral titres 
in serum were determined using a standard plaque assay from mice treated 
with Armstrong (squares), clone 13 (triangles) and clone 13 plus FTY720 
(circles). Unpaired t-tests comparing clone-13-infected mice with or without 
FTY720 treatment were as follows: P = 0.0066 day 5, P = 0.0001 day 8, 


thymectomized before infection (Supplementary Fig. 7). In the thy- 
mectomized mice, the kinetics of expansion of LCMV NP396-spe- 
cific cells was slower then cells specific for GP33. These results are 
consistent with the interpretation that at least a fraction of the func- 
tional cell populations seen after FTY720 treatment derives from 
previously dysfunctional LCMV-specific cells. 

Because CD4 T cells are important for the maintenance of CD8 
T-cell responses, we tested whether FTY720 can enhance immunity 
in mice depleted of CD4 T cells at the time of infection. CD4 T-cell 
depletion ablated the immune enhancing effects of FTY720 treatment 
and there was no significant reduction of viral titres (Fig. 3e), nor 
were CD8 T-cell responses enhanced when CD4 T cells were not 
present (Fig. 3f). However, CD4 T cells return to the circulation 
between days 10 to 15 after antibody-mediated depletion, and on 
their return, treatment with FTY720 (beginning on day 15 or later) 
restored LCMV-specific T-cell responses (Fig. 3f). CD4-depleted but 
ont untreated control mice showed a progressive loss of cells pro- 
ducing IFN-y in response to stimulation with GP33 and NP396 
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IFN-y 


——TEN-y 
P= 0.0256 day 15, P= 0.0010 day 22 and P = 0.0058 day 30. c-e, T-cell 
responses to LCMV epitopes were analysed by intracellular cytokine staining 
in mice infected with Armstrong (Arm), clone 13 (Cl-13) and FTY720- 
treated clone 13 (Cl-13 FTY). CD8 responses to GP33 (c), to NP396 (d), and 
CD4 responses to GP61 (e) (day 30 only) are shown. f, GP33 and NP396 CD8 
T-cell responses were analysed by intracellular cytokine staining for 
production of IL-2 and IFN-y at 60 days after infection. For all panels, mice 
were used at 3-5 per group; data are representative of 2-4 experiments. The 
numbers on the fluorescence-activated cell sorting (FACS) plot indicate the 
percentage of total CD8 or CD4 T cells in each quadrant. Error bars are s.d. 


peptides (data not shown), in agreement with previously published 
data’. These data show that CD4 T cells are necessary for the immune 
enhancing effects of FTY720 and demonstrate that FTY720-induced 
clearance of clone 13 is mediated by the immune response and is not 
directly due to any antiviral effects of the drug. 

Several other approaches have been used to treat chronic LCMV 
infections, including using anti-PD-L1 (ref. 17), anti-IL-10R (ref. 18) 
or anti-CD70 (ref. 19) antibodies, and therapeutic vaccination com- 
bined with antibody blockade**”', but none was as successful at clear- 
ing an established chronic infection as the transient use of FTY720. 
Although we were motivated to test FTY720 on the basis of our 
observations of reduced lymphopenia during clone 13 infection, 
we do not yet know the exact mechanism by which it is promoting 
clearance of chronic LCMV. On the basis of the known targets of 
FTY720 (ref. 10), we have no reason to suspect it is mechanistically 
connected to any of the previously described therapies. In this study 
we have shown that treatment with FTY720 leads to the restoration of 
functional antigen-specific CD8 and CD4 T cells. The restoration of 
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Figure 3 | Low-dose short-term FTY720 treatment clears an established 
persistent clone 13 infection. Wild-type mice were infected with LCMV 
clone 13 at 2 X 10° p.f.u. intravenously and treated with vehicle or FTY720 at 
4ugkg | intravenously on days 30, 31 and 32 after infection. a, Viral titres in 
the kidneys and brain were measured using a standard plaque assay at day 30 
after FTY720 treatment (60 days after infection; P = 0.0008, unpaired t-test). 
b, IFN-y, IL-2 and TNF-« production by LCMV-specific T cells in clone-13- 
infected (Cl-13) and FTY720-treated clone-13-infected (Cl-13 FTY) mice 
were analysed by intracellular cytokine staining. c, Total GP33-tetramer- 
positive CD8 T cells in the spleen 4 weeks after FTY720 treatment. d, PD-1 
and CD62L expression on GP33-tetramer-positive cells in the lymph nodes 
at day 30 after FTY720 treatment (day 60 after infection). e, Serum viral 
titres in clone-13-infected mice that were either treated with FTY720 (filled 


CD8 T cells is dependent on CD4 T cells and in thymectomized mice 
there is a delay in restoration. Therefore, it is possible that a de novo 
CD4 T-cell response is necessary to restore dysfunctional CD8 T cells. 
In addition, this treatment was not effective in B-cell knockout mice 
(data not shown), although it is not clear whether this is due to the 
lack of B cells or the lack of normal immune architecture”. 

It remains unclear as to how a drug well known for its immuno- 
suppressive effects is also able to induce immune-mediated clearance 
of a chronic viral infection. In the setting of multiple sclerosis and 
prevention of transplant rejection, FTY720 is thought to act by keep- 
ing cells sequestered in the lymph nodes, away from the peripheral 
sites of pathology. In contrast, during an infection lymph nodes are a 
site of immune induction, and in the context of an LCMV infection, 
lymph nodes are also a significant site of virus production and patho- 
logy. Because our current data on FTY720-induced clearance of a 
virus are limited to the single example of the LCMV model, it remains 
to be determined whether the effect translates to the successful treat- 
ment of other chronic infections, including those that afflict humans. 


METHODS SUMMARY 

Mice and infections. C57BL/6, C57BL/6-thymectomized mice and B6.PL- 
Thy1“/CyJ mice were purchased from the Jackson Laboratory; F5 mice were 
provided by D. Moskophidis; P14 mice were a gift from the Ahmed laboratory. 
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circles), or CD4 depleted and FTY720 treated (open circles) at the time of 
infection. CD4 depletion was performed by administration of 500 jig of 
GK1.5 monoclonal antibody at days 0 and 1; FTY720 was administered 
intravenously at 4 ugkg | on days 0, 1 and 2 after infection. f, GP33 and 
NP396 T-cell responses—measured by IFN-y and TNF-« production—were 
analysed in the spleens of clone-13-infected mice that were depleted of CD4 
T cells at the time of infection and treated with FTY720 at various time 
points after infection as indicated; T-cell responses were analysed 7 and 14 
days after FTY720 treatment. GP33 and NP396 CD8 T cells from LCMV- 
infected mice were stimulated with peptide and analysed for cytokine 
production 1 and 2 weeks after FTY720 treatment. All data are representative 
of 2-3 experiments; n = 3-5 per group. Error bars are s.d. 


LCMV Armstrong and clone 13 strains were provided by the Ahmed laboratory 
and administered intravenously at 2 X 10° plaque-forming units (p.f.u.). Where 
indicated, CD4 T cells were depleted as described'*. 

Administration of FTY720. FTY720 (Caymen Chemical) was reconstituted in 
DMSO and diluted with water and was administered intravenously at various 
doses for 3 days in a total volume of 200 ul. Control animals received mock 
treated water. 

Cell enumeration and staining. Cells were counted using a standard haemo- 
cytometer. Flow cytometric analysis was performed on a LSRII (Becton 
Dickinson). All monoclonal antibodies used were from Becton Dickinson, 
CalTag or eBioscience. Tetramers were produced as described”. 

Type I interferon measurement. A type I interferon bioassay was used to measure 
IEN-I as described’. IFN-o (Biomedical Laboratories PBL) was used as a control. 
S1P measurement: mass spectrometry. Serum and lymph nodes were collected 
from three mice and were pooled. Lymph nodes were weighed and homogenized. 
Sphingolipid extraction was performed using methanol and chloroform followed 
by potassium hydroxide. C17-sphingosine-1-phosphate (Avanti Polar Lipids) was 
used as an internal control. After extraction, sphingoid base 1-phosphate was quan- 
tified using liquid chromatography tandem mass spectrometry on reverse phase 
(C18) chromatography with a Perkin Elmer Series 200 HPLC system that interfaces 
with an API 2000 (Applied Biosystems) triple quadruple mass spectrometer”. 
Intracellular cytokine assay. Intracellular cytokine staining was performed as 
described’. 

Statistics. A two-tailed Student’s test determined all probabilities, calculated 
by inStat. 
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Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Mice, virus, infections and treatments. C57BL/6 (Ly5.2) and thymectomized 
mice were purchased from the Jackson Laboratory. P14 mice were a gift from the 
Ahmed laboratory and were housed in our animal facility. All infections were 
performed intravenously at 210° p.f.u. of LCMV clone 13 or LCMV 
Armstrong. Where indicated, CD4 T cells were depleted by injecting 500 pg 
GK1.5 in PBS on the day of infection and 1 day after infection. Viral titres in 
the serum and tissue homogenates were performed on Vero cells as described’. 
FTY720 (Cayman Chemical) was dissolved at 20 mg ml ' in DMSO. FTY270 was 
diluted in water and administered in a total volume of 200 ul, in a range of 
800ngkg | to 1mgkg ', intravenously. 

Tissue preparation and cell enumeration. Blood was collected either with or 
without heparin tubes and spun down to collect serum. Serum was used to 
measure type I IFNs, S1P levels and viral titres. Peripheral blood mononuclear 
cells were separated from heparin blood using Lympholyte (Cedarlane 
Laboratories Limited, CL5115). Tissues were collected and lymphocytes were 
separated by force; when indicated collagenase type 4 (Worthington 46E8824) 
was used at 1 mg ml '. Cells were counted visually using a haemocytometer. 
Flow cytometry and in vitro stimulations. Anti-CD8, anti-PD-1, anti-CD69, 
anti-IFN-y, anti-TNF-o and IL-2 were purchased from BD Biosciences. Anti- 
CD4, anti-CD19 and anti-thy antibodies were purchased from eBioscience. 
MHC class I tetramers were manufactured as described™*. To quantify LCMV 
specific CD8 T cells, DbNP396-404, DbGP33—41 and DbGP276—286 tetramers 
were used. Intracellular cytokine staining was performed according to the man- 
ufacturer’s protocol following a 6h stimulation of cells with 0.2 1g ml! of the 
indicated peptide. 

Type I IEN bioassay. IFN-I bioactivity was measured using the L929 bioassay™*. 
L929 cells (3 X 10* L929 cells per 100 pl) in MEM with 5% FCS were added to a 
96-well plate and incubated overnight at 37 °C in successive twofold dilutions of 
serum samples. The murine IFN standard (Biomedical Laboratories PBL, 12100- 
1) was used at a final concentration of 100 Uml7!. Twenty-four hours later, 
vesicular stomatitis virus was added to the wells at 50 p.f.u. per well and the cells 
were incubated for 3 days. The dilution mediating 50% protection was defined as 
1U of IEN-I per ml. 

S1P measurement: mass spectrometry. Serum and lymph nodes from three 
mice were collected and pooled. Lymph nodes were weighed and homogenized. 
Sphingolipid extraction was performed using methanol and chloroform fol- 
lowed by potassium hydroxide. C17-sphingosine-1-phosphate (Avanti Polar 
Lipids) served as an internal control. Following extraction, sphingoid base 
1-phosphate was quantified using liquid chromatography tandem mass spectro- 
metry on reverse phase (C18) chromatography with a Perkin Elmer Series 200 
HPLC system that interfaces with an API 2000 (Applied Biosystems) triple 
quadruple mass spectrometer”. 
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A discontinuous hammerhead ribozyme embedded in 
a mammalian messenger RNA 


Monika Martick'*, Lucas H. Horan'*, Harry F. Noller’ & William G. Scott’ 


Structured RNAs embedded in the untranslated regions (UTRs) of 
messenger RNAs can regulate gene expression. In bacteria, control 
of a metabolite gene is mediated by the self-cleaving activity of a 
ribozyme embedded in its 5’ UTR’. This discovery has raised the 
question of whether gene-regulating ribozymes also exist in eukar- 
yotic mRNAs. Here we show that highly active hammerhead ribo- 
zymes~” are present in the 3’ UTRs of rodent C-type lectin type II 
(Clec2) genes*’. Using a hammerhead RNA motif search with 
relaxed delimitation of the non-conserved regions, we detected 
ribozyme sequences in which the invariant regions, in contrast 
to the previously identified continuous hammerheads*", occur 
as two fragments separated by hundreds of nucleotides. Notably, 
a fragment pair can assemble to form an active hammerhead ribo- 
zyme structure between the translation termination and the poly- 
adenylation signals within the 3’ UTR. We demonstrate that this 
hammerhead structure can self-cleave both in vitro and in vivo, 
and is able to reduce protein expression in mouse cells. These 
results indicate that an unrecognized mechanism of post-tran- 
scriptional gene regulation involving association of discontinuous 
ribozyme sequences within an mRNA may be modulating the 
expression of several CLEC2 proteins that function in bone 
remodelling and the immune response of several mammals. 

The hammerhead ribozyme is a small, self-cleaving motif com- 
posed of a three-helical junction with a core of invariant nucleotides 
required for activity. To identify hammerhead ribozymes in mam- 
malian mRNAs, we searched mRNA sequence databases using a pat- 
tern descriptor that allowed for insertions of up to 5,000 nucleotides 
at the ends of stem 1 or stem 3 (Supplementary Fig. 1)''’*. Three 
hammerhead ribozymes were identified in the 3’ UTRs of known 
rodent mRNAs. Two are found embedded in the transcripts of mouse 
Clec2d (osteoclast inhibitory lectin, also known as Ocil, Clr-b and 
Clec2d8)'* and its paralogue Clec2e (also known as Clra and 
Clec2d7)“*, genes that belong to a group of phylogenetically related 
sequences within the natural killer receptor gene complex of chro- 
mosome 6. The third ribozyme is found in rat CLEC2D11 (ref. 7)—a 
homologue of mouse Clec2d—which resides in the syntenic natural 
killer receptor gene complex region on chromosome 4. We extended 
our search to the genomic sequences of other organisms using the 
UCSC genome browser’s comparative genomics tool’. Alignments 
using the natural killer receptor gene complex regions of mouse and 
rat led to the identification of nine candidate hammerhead ribo- 
zymes: four in the 3’ regions of predicted rat, horse and platypus 
Clec2-like genes, and five in the unannotated regions of five other 
mammalian genomes (Supplementary Fig. 2). 

Unlike most known self-cleaving RNA motifs that are 
contiguous*'®'*", the hammerhead ribozymes identified here 
(referred to as CLEC2 ribozymes) are split into two fragments sepa- 
rated by a long ribozyme-unrelated insertion in the stem-1-capping 


loop. The insertion (which is 250 and 696 nucleotides in mouse 
Clec2d and Clec2e, respectively, and 145—1,739 nucleotides in other 
candidate ribozyme sequences) segregates the upstream substrate 
region, residing within 7-44 nucleotides of the stop codon, from 
the downstream enzyme fragment (Fig. 1 and Supplementary Fig. 
2). When assembled from substrate and enzyme fragments, the sec- 
ondary structure reflects a characteristic hammerhead ribozyme core 
of fifteen conserved nucleotides flanked by three helices. Comparison 
of all twelve CLEC2 ribozymes showed conservation of stem 2 as well 
as the presence of compensatory mutations maintaining stem 1 and 
the atypically long stem 3. Stem 1 and stem 2 contain secondary 
structure elements required to form a tertiary contact known to 
enhance catalysis greatly (Fig. 1 and Supplementary Fig. 2)*”°7". Six 
loop and bulge residues (Fig. 1)—necessary for the active structure 
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Figure 1| Sequence arrangement and secondary structure model of the 
rodent Clec2d hammerhead ribozymes. The mouse ribozyme sequence is 
shown in black and the rat ribozyme sequence length, single nucleotide, and 
base pair differences are denoted in red. The stop codon is shown in white. 
The substrate and enzyme sequences are shown on orange and blue 
backgrounds, respectively. The insertion sequence separating two ribozyme 
parts is abridged with a thick arrow. The predicted cleavage site (white 
arrowhead) is 3’ of the active site cytosine (circled). The three-helical 
junction, composed of conserved (with the exception of 2.1 and 1.1) 
nucleotides (nt) that make catalytically important interactions, is shown in 
greater detail (together with canonical numbering scheme) in the inset. The 
small black arrowheads indicate conserved nucleotides of the catalytically 
important loop/bulge interactions. 
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stabilization and optimal catalysis in the Schistosoma mansoni ham- 
merhead_ ribozyme*’—are also found in most CLEC2 ribozyme 
sequences. Taken together, the secondary structure features suggest 
that the 3’ UTR-embedded hammerhead ribozyme motifs form cat- 
alytically active structures despite their discontinuity. 

To test whether the hammerhead ribozymes embedded within the 
3' UTRs possess catalytic activity, we examined the in vitro transcrip- 
tion products of mouse 3’ UTRs from Clec2d (D-hcu, hammerhead 
containing UTR) and Clec2e (E-hcu). Cleavage of both 3’ UTR RNAs 
occurs during the course of the transcription reaction (data not 
shown) and at the predicted hammerhead ribozyme cleavage sites 
(Fig. 2). The DM1-hcu construct—which is expected to create an 
inactive hammerhead ribozyme due to a triple mutation that prevents 
the ‘substrate’ from being cleaved—yields an intact 3’ UTR transcript, 
thus confirming that the cleavage is due to the embedded hammer- 
head ribozyme activity. The mouse Clec2e-derived 3’ UTR cleaves to a 
lesser extent than D-hcu, perhaps because of misfolding effects due to 
a ~450 nucleotide longer intervening sequence and/or weakened sec- 
ondary structure in the stem 1 and stem 3 regions (Supplementary Fig. 
3). These results demonstrate that the discontinuous ‘substrate’ and 
‘enzyme’ fragments form a fully active hammerhead ribozyme. 

To determine whether the D-hcu sequence can reduce gene 
expression by forming an active ribozyme in vivo, it was incorporated 
into a dual-luciferase gene expression vector and assayed for activity 
in NIH 3T3 cells (Fig. 3). D-hcu reduces luciferase expression by 80% 
when placed downstream from the luciferase reporter (Fig. 3b). 
Nonspecific downregulation effects were ruled out because inversion 
of the same sequence (R-D-hcu) restores luciferase expression levels 


123 4 5 6 7 


P| E-hcu 
transcription start 


DM1-hcu 
transcription start 


t 
a 


\ 


5’ OH 

Us 4 

G 

U 

0 por 

G 

« |p \-daelmmes = |< Cleaved 
U _—- 

G =. 

Cc — 

U wie ae = ig 
A 

Cc 

U 

- 


Figure 2 | The discontinuous hammerhead ribozyme-containing 3’ UTRs 
(hcu) self-cleave in vitro. Reverse transcriptase primer extensions of in vitro 
transcribed mouse Clec2d (D-hcu, lane 5) and Clec2e (E-hcu, lane 7) 3’ UTRs 
show that both RNAs are cleaved at a single location (open arrowheads). 
D-hcu cleavage site 5’ of U;., was identified by dideoxy sequencing of 
transcription products. Uncleaved E-hcu and triple mutant DM1-hcu are 
indicated with a black arrowhead. Asterisks indicate non-extended labelled 
primers. 
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to that of the positive control. To verify that the reduction of gene 
expression was caused by the embedded ribozyme, mutations specif- 
ically targeting hammerhead ribozyme activity were introduced into 
D-hcu. Both a single transversion mutation at position G8 in the 
enzyme fragment (DM2-hcu), previously shown to compromise 
hammerhead ribozyme activity’, and a triple mutation in the sub- 
strate fragment (DM1-hcu) restore gene expression to control levels, 
indicating that the hammerhead ribozyme embedded in the mouse 
Clec2d 3' UTRis responsible for reducing reporter protein expression 
in mouse cells. 

RNA from cells transfected with either wild-type (D-hcu) or 
mutant (DM1-hcu) dual luciferase reporter vectors was compared 
using reverse transcription—polymerase chain reaction (RT-PCR) to 
assess the effect of the discontinuous ribozyme-containing 3’ UTR on 
mRNA integrity. A mixture of two primers, each recognizing one of 
two reporter genes transcribed from the same vector, was used for 
first-strand complementary DNA synthesis (Fig. 4a). Amplification 
of the region spanning the ribozyme cleavage site indicates that 
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Figure 3 | The discontinuous mouse Clec2 hammerhead ribozymes 
embedded in the 3’ UTRs downregulate protein expression in vivo. 

a, Layout of the constructs (top) and the schematic of hammerhead 
ribozyme-containing 3’ UTR (hcu) sequences used for expression. The 
colours of substrate (orange) and enzyme (blue) regions correspond to those 
of the structure in Fig. 1 and the insertion is highlighted by a thick line. The 
cleavage sites are indicated by white arrowheads and asterisks denote the 
positions of mutations. CHR (control hammerhead ribozyme) and CHRM 
(G8 to C8 mutant of CHR) are the controls; bp, base pairs. b, In vivo analysis 
of firefly luciferase protein expression from different ‘hcw’ constructs. Black 
denotes expression from the vector containing wild type 3’ UTR; grey 
represents mutants and controls; and white indicates a vector without an 
insert as a negative control. Relative firefly luciferase (hLuc™ ) expression was 
determined using a dual luciferase assay using the Renilla luciferase (hRLuc) 
expression for normalization. The results shown are means = s.e.m. of 
triplicate experiments; the error bars for the vector value are averages of 
s.e.m. from three assays. 
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mouse Clec2d 3' UTR self-cleaves via the hammerhead ribozyme 
activity (Fig. 4b). The uncleaved product of D-hcu is at least tenfold 
reduced from that of DM1-hcu, in accordance with results of the 
protein expression assay (Fig. 4c). Previous studies have shown that 
ribozymes artificially engineered into 3’ UTRs can downregulate 
gene expression” by promoting rapid destruction of the transcript 
by cytoplasmic RNA degradation machinery™. Consistent with these 
observations, the catalytic activity of mouse Clec2d 3’ UTR correlates 
with the overall reduction of firefly luciferase transcript levels, as 
compared to those of the mutant UTR-containing mRNA, indicating 
that the cleaved Clec2 transcript is degraded in vivo. 

Structural conservation observed throughout the identified 
sequences, in conjunction with the tight association of the ribozyme 
with Clec2-like genes, suggests that the discontinuous ribozymes are 
orthologous to the mouse hammerhead motif characterized here. 
Within the rodent lineage, the rat CLEC2D11 ribozyme sequence 
shares 86% homology with the mouse Clec2d ribozyme. Similarly, 
two predicted rat Clec2d paralogues are closely (61% and 67%) 
related to the mouse Clec2e ribozyme (Supplementary Fig. 2). 
These sequence relationships positively correlate with phylogenetic 
evidence for multiple duplication events of rodent Clec2 genes’. 
Further genome searches using separate parts of the hammerhead 
ribozyme detected four additional mouse Clec2 paralogues that con- 
tain substrate-like, but not enzyme, sequences in their 3’ UTRs (data 
not shown). These partial motifs are also positioned proximally to 
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= —— 
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b D-hcu DM1-hcu 
Uncleaved as 643 bp 
hLuct+ 
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30 33 36 30 33 36 
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¢ RT-PCR 
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Figure 4 | RT-PCR analysis of the in vivo expression products. a, Schematic 
of the relative positions of the RT-PCR primers. The hammerhead-specific 
cleavage site is indicated by an open arrowhead. The 368 base pair (bp) 
product acts as the internal control for transfection efficiency and the extent 
of transcription. b, RT-PCR of RNA isolated from NIH 3T3 cells that have 
been transfected with pD-hcu or pDM1-hcu. The asterisk indicates 
nonspecific PCR products. ¢, Quantification of the difference in PCR 
product intensity between D-hcu and DM1-hcu. The product intensity was 
used to determine the ratio of D-hcu/DM1-hcu at each listed cycle. Each 
value represents the mean + s.d. of intensity ratios from three listed cycles. 
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the stop codon and share extensive primary sequence identity with 
the substrate regions of functional ribozymes of either mouse Clec2d 
or Clec2e. However, it is not clear if the incomplete ribozyme 
sequences are functional substrates for the Clec2d- and Clec2e- 
derived trans-acting ribozyme elements, or are non-functional 
evolutionary relics of the rodent Clec2 gene expansion. 

Identification of CLEC2 ribozymes in the genomes of mammalian 
species other than rodents (notably platypus) indicates that these 
ribozymes share an early mammalian ancestor. Active hammerhead 
motifs have been found in several divergent species, such as newts", 
plants* and viroids'®. However, only the Schistosoma satellite DNA is 
known to encode hammerhead ribozyme sequences that have signifi- 
cant homology to the CLEC2 ribozymes*’*. Although schistosomes 
parasitize rodents, their ribozymes are found in lineages more mod- 
ern than the time of divergence of mouse and rat species'*”’, suggest- 
ing a horizontal gene transfer of the hammerhead sequence from host 
to the parasite genome as the probable scenario. 

In eukaryotes, post-transcriptional regulation of gene expression 
through the 3’ UTR involves an interaction between mRNA-stabil- 
izing and -destabilizing protein-based processes”. Together with 
evidence of cross-species secondary structure conservation, our 
results indicate that the CLEC2 hammerhead ribozymes have roles 
analogous to those of destabilizing protein factors. Paired with a yet 
undiscovered antagonistic process, the hammerhead motif may con- 
stitute a system of rapid regulation for CLEC2d, a transmembrane 
C-type lectin that has a dual role: the inhibition of natural killer 
(NK)-cell-mediated lysis through the interaction with the NK cell 
receptor NKR-P1, and the inhibition of osteoclast formation*”. 

The core sequences of CLEC2 ribozymes resemble those of the 
smaller members of its ribozyme class, but the discontinuous 
arrangement of the substrate and enzyme regions exhibit properties 
previously associated only with catalytic introns. Although function- 
ing through a fundamentally different mechanism and residing in 
transcripts distinct from processed mRNA, catalytic sequences of 
some self-splicing group I and II introns span long stretches of 
RNA” on a scale analogous to that of the ribozymes described here. 
A question remains as to whether CLEC2 ribozymes can also process 
substrates encoded on detached transcripts similarly to RNase P, 
which possesses trans-cleaving properties unique among naturally 
occurring ribozymes’. Most self-cleaving RNAs, however, are con- 
sidered to be compact sequences suitable for single-step functions 
such as multimer-to-monomer conversion of genomes in the course 
of viroid and satellite RNA replication, or for response to small 
molecule binding such as in metabolite-dependent gene regulation 
in bacteria. However, in the context of complex gene regulatory 
systems such as those of mammalian cells, the discontinuous config- 
uration of the CLEC2 ribozyme sequences may provide an oppor- 
tunity for allosteric regulation. 


METHODS SUMMARY 


mRNA sequence data from the Ensembl database (release 48, http://www.en- 
sembl.org) were searched using the RNABOB program'”* with a descriptor 
(Supplementary Fig. 1) for the hammerhead ribozyme motif. Orthologues of 
the rodent CLEC2 ribozymes were identified using BLAT and the UCSC Genome 
Browser (http://genome.ucsc.edu/)'*”’. For in vitro RNA synthesis and analysis, 
the Clec2 3' UTR-containing plasmids were linearized and transcribed using T7 
RNA polymerase at a pH of 8.0 and in the presence of 28 mM MgCl. Reverse 
transcriptase primer extensions and RNA sequencing were adapted for a [°*P]- 
end-labelled primer from previously published methods”. For the expression 
assays, the NIH 3T3 mouse fibroblast cells were transiently transfected with Clec2 
3’ UTR-containing psiCHECK-2 reporter plasmids (Promega) and analysed 
using the Dual-Glo Luciferase Assay System (Promega). Firefly reporter express- 
ion was normalized using expression of the Renilla luciferase encoded on the 
same plasmid. RNA from the transfected cells was purified using Trizol (Sigma). 
For RT-PCR, cDNA was synthesized using primers specific to plasmid-derived 
transcripts using previous methods”*. PCR was performed with Accuprime Taq 
(Invitrogen). PCR products were resolved on an agarose gel and the product 
quantification was performed using ImageQuant 5.2 (Molecular Dynamics). 
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Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Motif search and secondary structure. The RNABOB program'"” was used to 
search mRNA sequence data from the Ensembl database (release 48, http:// 
www.ensembl.org) using a descriptor (Supplementary Fig. 1) for the hammer- 
head ribozyme motif. The secondary structure was built using information from 
the Schistosoma hammerhead ribozyme crystal structure’. Mouse and rat Clec2 
paralogues encoding partial and full ribozyme sequences were identified using 
BLAT and the UCSC Genome browser'*’’. Alignment was generated using 
ClustalW”. Clec2d orthologues were identified using the 30-way Multiz align- 
ment and conservation tool in the UCSC genome browser'*. Sequences with 
similarity to stem 2 regions of the rodent CLEC2 ribozymes were selected for 
further analysis. These sequences were first screened for completeness of stem 2 
and conserved hammerhead ribozyme nucleotides in the enzyme strand. The 
candidate ribozymes were then selected on the basis of the expected sequence 
(the ability to form stems 1 and 3 together with the enzyme strand) and orienta- 
tion of the substrate within the 2,000 bp regions flanking stem 2. 

Constructs. The hammerhead ribozyme-containing 3’ UTR sequences 
(GenBank AF321553 nucleotides 684—1,179 for D-hcu; NM_153506 nucleotides 
695-1,710 for E-hcu) were cloned from total DNA (BioChain Institute, Inc.) of 
BALB/c mice. The DM1-hcu construct is the same as D-hcu with the following 
exception: the hammerhead ribozyme has been inactivated by a triple mutation 
spanning the cleavage site (5’-A(16.1)G(17)A(1.1)-3’). The DM2-hcu construct 
contains the hammerhead ribozyme inactivated by a single base (G8 to C8) 
mutation (refer to Fig. 1 for residue identification), with the rest of the sequence 
identical to D-hcu. The R-D-hcu construct contains D-hcu sequence in the anti- 
sense direction. The positive controls CHR (control hammerhead ribozyme) and 
CHRM (G8 to C8 mutant of CHR) were derived from Schistosoma ribozyme’ and 
were oriented analogously to the CLEC2 ribozymes—with stem 3 connecting the 
structure to the rest of the transcript. Each of the two controls was capped with an 
artificial tetraloop closing stem 1 (full CHR sequence with tetraloop underlined 
and the CHRM mutation position preceded by an asterisk: 5’-CCGGCGT 
CCTGGTATCCAATCCTTCGGGATGTACTACCAGCTGAT*GAGTCCCAAA 
TAGGACGAAACGCCGG-3’). For in vitro transcription assays, 3’ UTR 
sequences were cloned into the Xbal site of the pET-20(b) vector (Novagen). 
For expression assays, 3’ UTR sequences and control ribozymes were placed 7 
nucleotides downstream of the firefly luciferase open reading frame (Xbal site) in 
psiCHECK-2 dual luciferase vector (Promega). Cognate poly A sites have been 
omitted from all 3’ UTR sequences and replaced with vector-encoded SV40 
Poly(A) signal. The psiCHECK-2 vector also contained a Renilla luciferase gene, 
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the expression product of which was used as an internal control for transfection 
efficiency and the extent of expression. 

In vitro transcription and reverse transcriptase primer extension. pET-20(b) 
constructs were linearized and transcribed for 1 h at 37 °C using T7 RNAP in the 
following conditions: 50-60 uM double-stranded plasmid, 5 mM ribonucleotide 
triphosphates, 28 mM MgCl,, 2mM spermidine-HCl, 100 mM dithiothreitol, 
0.01% Triton X-100 and 40 mM Tris-HCl buffer (pH 8.0). Transcripts were 
purified using phenol and chloroform after the transcription reaction and then 
again after the treatment with RQ-DNase (Promega). Reverse transcriptase pri- 
mer extensions and RNA sequencing were adapted for a [*’P]-end-labelled 
primer from previously published methods”. 

Reporter expression assays. NIH 3T3 mouse fibroblast cells were transiently 
transfected using Lipofectamine LTX (Invitrogen) according to the manufac- 
turer’s instructions and assayed using the Dual-Glo Luciferase Assay System 
(Promega). The firefly/Renilla luciferase expression ratios were normalized to 
those of the psiCHECK-2 vector. 

RT-PCR. NIH 3T3 cells were transfected with mouse Clec2d 3’ UTR-containing 
psiCHECK-2 plasmids in 10-cm plates. Whole-cell RNA was purified using 
Trizol (Sigma), treated with RQ-DNase (Promega), and purified with phenol 
and chloroform. cDNA was synthesized using two primers specific to two dif- 
ferent (experimental and control) plasmid-derived transcripts using previous 
methods*. Primer sequences for cDNA _ synthesis were 5/- 
CATGTCTGCTCGAAGCGGC-3’ for the ribozyme-containing firefly luciferase 
(hLuc*) transcript, and 5’-TCCTCAGGCTCCAGTTTCC-3’ for the control 
Renilla luciferase (hRLuc) transcript. Primer extension products were treated 
with RNase A and RNase H, purified with phenol and chloroform, and re- 
suspended in water. For PCR reactions, Accuprime Taq along with Buffer II 
(Invitrogen) were used according to manufacturer’s instructions. For PCR of 
the ribozyme-containing hLuc* , the hLuc* cDNA primer was used together with 
5'-GTGGACGAGGTGCCCAAGG-3’ (forward) for the 643 nucleotide product, 
and 5'-GCTACTCTTTCCCCTATGGTC-3’ (forward) for the 494 nucleotide 
product. For PCR of the control hRLuc, the hRLuc cDNA primer was used 
together with 5'’-CTGATCTGATCGGAATGGG-3’ (forward). PCR reactions 
were placed on ice after the designated number of cycles, resolved on a 2% 
agarose gel and stained with ethidium bromide. PCR product intensities were 
quantified using ImageQuant 5.2 (Molecular Dynamics). 


29. Larkin, M. A. et al. Clustal W and Clustal X version 2.0. Bioinformatics 23, 
2947-2948 (2007). 


©2008 Macmillan Publishers Limited. All rights reserved 


Vol 454/14 August 2008|doi:10.1038/natureO7162 


nature 


LETTERS 


Dynamic thiolation-thioesterase structure of a 
non-ribosomal peptide synthetase 


Dominique P. Frueh', Haribabu Arthanari’, Alexander Koglin', David A. Vosburg't, Andrew E. Bennett’, 


Christopher T. Walsh’ & Gerhard Wagner’ 


Non-ribosomal peptide synthetases (NRPS) and_ polyketide 
synthases (PKS) produce numerous secondary metabolites with 
various therapeutic/antibiotic properties’. Like fatty acid synthases 
(FAS), these enzymes are organized in modular assembly lines in 
which each module, made of conserved domains, incorporates a 
given monomer unit into the growing chain. Knowledge about 
domain or module interactions may enable reengineering of this 
assembly line enzymatic organization and open avenues for the 
design of new bioactive compounds with improved therapeutic 
properties. So far, little structural information has been available 
on how the domains interact and communicate. This may be 
because of inherent interdomain mobility hindering crystallization, 
or because crystallized molecules may not represent the active 
domain orientations’. In solution, the large size and internal 
dynamics of multidomain fragments (>35 kilodaltons) make struc- 
ture determination by nuclear magnetic resonance a challenge and 
require advanced technologies. Here we present the solution struc- 
ture of the apo-thiolation—thioesterase (T-TE) di-domain fragment 
of the Escherichia coli enterobactin synthetase EntF NRPS subunit. 
In the holoenzyme, the T domain carries the growing chain tethered 
to a 4’-phosphopantetheine whereas the TE domain catalyses 
hydrolysis and cyclization of the iron chelator enterobactin. The 
T-TE di-domain forms a compact but dynamic structure with a 
well-defined domain interface; the two active sites are at a suitable 
distance for substrate transfer from T to TE. We observe extensive 
interdomain and intradomain motions for well-defined regions and 
show that these are modulated by interactions with proteins that 
participate in the biosynthesis. The T—-TE interaction described here 
provides a model for NRPS, PKS and FAS function in general as 
T-TE-like di-domains typically catalyse the last step in numerous 
assembly-line chain-termination machineries. 

The Escherichia coli enterobactin synthetase (Ent) synthesizes the 
siderophore enterobactin, a virulence factor used by E. coli to infect 
iron-limited microenvironments of vertebrate hosts. Enterobactin is 
obtained by three iterative condensations of dihydroxybenzoate 
(DHB) with serine, followed by macrocyclizing release of the macro- 
lactone that can form high-affinity hexadentate complexes with ferric 
iron** (Fig. 1). Ent is a two-module (EntB/EntE and EntF), three- 
protein assembly line that combines the features of both a type I 
NRPS, in which all domains interact in cis in a single protein chain 
(EntF), and a type II NRPS, in which the domains interact in trans as 
free-standing units (EntB/EntE). The dedicated phosphopantethei- 
nyl transferase (PPTase), EntD, loads the apo form of the T domain 
with 4’-phosphopantetheine (4’-PP), which tethers the growing acyl 
chain while allowing the bound substrates to navigate between the 
catalytic sites (Fig. la). The EntE and EntF adenylation (A) domains 
select the appropriate substrates (DHB or serine) and activate them 


as acyl-adenosine-monophosphates. These are then loaded on the 4’- 
PP arms attached to Ser 245 of the holo-EntB aryl carrier protein 
(ArCP) and to Ser48 of holo-EntF T domains, respectively 
(Fig. 1b). The EntF condensation (C) domain catalyses the DHB— 
Ser amide-bond formation (Fig. 1c). The DHB-Ser chains are next 
transferred to the active site Ser 180 of the thioesterase (TE) domain, 
thereby making the 4’-PP arm of the T domain available for the next 
cycle of condensation (Fig. 1d). In this step of the reaction, the 4’-PP 
arm (wavy line in Fig. 1d) has to reach from Ser 48 of the T domain to 
Ser 180 of the TE domain, in which the triad Ser 180, Asp 207 and 
His 313 catalyses the oxoester bond formation between the substrate 
and Ser 180. The steps shown in Fig. 1b-d are repeated twice to 
produce the tethered linear DHB-Ser trimer before its intramolecu- 
lar cyclization by the TE domain of EntF, which constitutes the 
release of the enterobactin trilactone (Fig. le). The structural basis 
of communication between the T and TE domains in chain-release 
steps at the end of NRPS assembly lines has been so far unknown. 

Here we describe the structure of the 37 kDa apo-EntF T-TE di- 
domain in which the active site Ser 48 (of the T domain) was replaced 
with Ala to enable production of a homogeneous sample. This muta- 
tion is isomorphous as the NMRs of the T-TE di-domain are 
unchanged except for the immediate environment of residue 48, 
and the interdomain contacts are identical (see Methods and 
Supplementary Figs 1 and 2). It represents the physiological structure 
before CoA addition as indicated at the top of Fig. la. The mutated 
apoprotein allows us to distinguish domain interactions from 
domain/substrate contributions (including the 4’-PP arm), a crucial 
knowledge for successful assembly line reprogramming by domain 
swap. The T and TE domains have a well-defined relative orientation 
with a primarily hydrophobic interface (Fig. 2a, c). This results in a 
1,300 A? buried surface area not including the linker, which interacts 
with both domains and contributes to a further 200 A”. The globular 
T domain forms a three-helical bundle that is wedged between the 
globular core of the thioesterase and two a-helices (%47p—-05 rz) pro- 
truding from this core. These helices resemble two webbed fingers 
that form a lid covering the T and TE active sites. Ser48Ala and 
Ser 180 are found 17 A apart at both ends of a canyon formed by loop 
L127, and helix «4p. This allows the 4'-PP arm, which is tethered to 
Ser 48 of the T domain in the holoprotein and can span up to ~20 A, 
to reach Ser 180 of the TE domain. Dynamics data indicate that the 
two helices (%4;p—05;p) form a mobile flap that is able to open to 
accommodate the 4’-PP arm (see below). Thus, the structure should 
not be interpreted as rigid but rapidly fluctuating to allow loading of 
the 4’-PP arm and the growing enterobactin chain. This resembles 
many other proteins that have to undergo major opening fluctua- 
tions to allow binding of target proteins or substrates, such as HIV 
protease’. 
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The T domain contacts TE via the beginning of helix «2,7 that 
contains Ser48Ala, the end of loop L3;, and the single turn helix 
al’; (Fig. 2c). The region near Ser48Ala interacts with both the core 
of the TE domain, via L2z, and the tips of the helical fingers, which 
thus cover the T active site (Fig. 2b). In EntF, helix 1’; stabilizes the 
hydrophobic core of the T domain with a phenylalanine side chain 
(Phe 41) anchored in the heart of the helical bundle. An adjacent 
phenylalanine (Phe 42) interacts with hydrophobic residues of B17. 
Point mutations of either phenylalanines lead to a disruption of the 
T/TE domain interaction and an unfolded T domain (data not 
shown). 1"; is absent in the only other structure of an (excised) T 
domain®’, but has been observed in related FAS and PKS acyl carrier 
proteins (ACP) and may contribute to domain recognition*”° (see 
Supplementary Information). Loop L3;, which interacts with Bly, 
features helical characteristics and is mobile as indicated by the low 
intensities of the corresponding NMR signals (Supplementary Fig. 8). 


sa HO 
CN a7 
apo-EntB apo-EntF 
CoA CoA 
EntD a EntD 


3’,5’-ADP 


Figure 1| Summary of enterobactin synthesis. The grey, red and blue boxes 
represent individual domains; the C, A, T and TE domains are described in 
the text. The EntB ICL domain is an isochorismate ligase that participates in 
the conversion of chorismate to DHB, a preliminary step to the synthesis of 
enterobactin. Initially, EntD loads the phosphopantetheinyl arm (4'-PP, 
represented as a wavy line) onto the active-site serines of apo-EntB and apo- 
EntF. The synthesis steps (a—e) are described in the text. 
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In general, the T domain shows internal mobility as further 
supported by fast NH exchange (Supplementary Fig. 9). Such 
dynamics have been observed for single-domain ACP-type T 
domains in FAS systems'', and may be a general property of acyl 
and peptidyl carrier proteins (see Supplementary Information). 
The TE domain adopts the overall fold of o/$-hydrolases. 
However, the ‘finger’ region differs in the two systems for which 
TE structures have been determined’®”’. Helix «4-7, faces the catalytic 
triad, located in a conserved region, and delimits part of the ‘cycliza- 
tion bucket’ previously described for substrate macrocyclization in 
surfactin type I thioesterase’* (TEI; Fig. 2b, grey and white). For 
surfactin TEI, two conformers have been observed for this helix!’, 
whereas the electron density was too low in fengimycin TEI to allow 
any structural interpretation’® (see Supplementary Information). 
Moreover, NMR studies of the free standing surfactin type II thio- 
esterase (TEII) indicated two conformations in the corresponding 
lid, Accordingly, many residues in this region of the EntF TE 
domain show reduced intensities in their NMR signals, which is 
indicative of slow modulations of their environments 
(Supplementary Fig. 8). Furthermore, unlike the core of the TE 
domain, all amide protons of the finger region exchange rapidly with 


Figure 2 | Structure of the EntF T-TE fragment. a, A ribbon diagram is 
shown. The T domain (red) is wedged between the core (blue) and the lid 
(green) of the TE domain. Active sites are shown as yellow spheres. The 
double-headed arrow emphasizes that the flap formed by the «47x and a5 75 
helices is relatively mobile, opening frequently, which seems to be necessary 
to accommodate the 4’-PP arm in the processes depicted in Fig. 1. b, Surface 
representation of the region containing the canyon (grey) which must open 
to accommodate the 4’-PP arm in the holoprotein. The cyclization bucket'” 
is shown in white and grey. In this conformation, Ser48Ala and Ser 180 are 
buried. c, The domain interface is shown. The side chains of all residues 
giving rise to interdomain nuclear Overhauser effects (NOEs) are shown. 
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solvent (Supplementary Fig. 9), indicative of breathing/opening 
motions. Such mobility may be necessary to allow for access to both 
T and TE active sites, which would be occluded in a more rigid 
canyon structure (see later). The mobility also provides conforma- 
tional plasticity to the TE active site to accommodate the various 
intermediates stored on Ser 180 and allow the 4'-PP arm to navigate 
in and out of the canyon in the holoprotein, which would not be 
possible in a static interpretation of our solution structure. 

In addition to the intradomain dynamics reported above, we 
observed an open form of the T-TE molecule, T-TE®, in which TE 
and T interact little or not at all (see Supplementary Information and 
Fig. 3). Indeed, although ACP-type T domains have recently been 
observed in FAS’*", they have previously been reported to be invis- 
ible in the crystal structures of FAS or PKS due to low electron 
density’, in agreement with dynamic domain interactions. To 
investigate the significance of these dynamic events, the T-TE frag- 
ment was titrated with binding partners that participate in different 
steps of enterobactin synthesis. To become functional as carrier pro- 
teins, T domains first need to be primed with 4’-PP arms by a PPTase 
(Fig. la). When the broad-specificity PPTase Sfp’? was added to a 
solution of T—-TE, a binding site was identified around the active-site 
residue 48, as expected® (Fig. 3a, green). In addition, however, the 
subset of signals of the T-TE° conformer increased, suggesting that 
Sfp drives the equilibrium towards the dislodged form of the di- 
domain (Fig. 3). This is reasonable because Ser48Ala is partially 
covered by the tips of the fingers in the di-domain (Fig. 2b). The 
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Figure 3 | Interaction with Sfp. a, Residues subject to two different 
environments. The colour code represents small (blue), medium (red) and 
large (yellow) chemical shift differences between the two forms as reported 
in b. c, Chemical shift differences between the major conformer of free T-TE 
and the excised TE domain. d-f, Spectral signatures of the Sfp interaction. 
d, The reference spectrum of Ser48Ala. The open form is only visible with 
longer acquisitions (Supplementary Fig. 3). On addition of Sfp, the signals of 
the residues in green (see a) disappear from the spectrum, and those of the 
second form appear (e). f, Corresponding region of the spectrum of the 
excised TE. 
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simultaneous observation of new signals and shifts of resonances 
(Supplementary Fig. 11) indicates the presence of complex dynamic 
processes involved in the function of this enzyme. The EntF-specific 
EntD enzyme” caused similar selection of the open state; in contrast, 
the ACP-specific AcpS PPTase”’ did not (Supplementary Fig. 12), 
highlighting the specificity of the mechanism. 

The EntF C domain” condenses the substrates that are covalently 
attached as thioesters to 4’-PP arms on the EntB and EntF T domains 
(Fig. 1c). Specific chemical shift changes on addition of the EntF C 
domain to T—TE showed the C-domain-binding face on T (Fig. 4), 
and only minor effects were observed on TE. Because the T/C surface 
does not overlap with the TE interaction surface, no disruption of the 
T/TE interface is necessary and only a modulation of their interaction 
is observed. Notably, no interaction was observed when the TycC5 
(the fifth module of the tyrocidine synthetase C subunit) C domain 
was added to a T—TE solution, suggesting that this T—C recognition is 
EntF specific (see Supplementary Fig. 13). For both C-domain and 
Sfp interactions, perturbations on TE are not limited to the interface, 
but affect the finger region and the TE active site, showing that the 
binding events probably influence the dynamics and/or the confor- 
mations in these areas. These observations apply to the apo form of 
the molecule and thus only report on domain—domain interactions. 
It is probable that during biosynthesis the 4’-PP arm, loaded with 
various intermediates, also affects these dynamic effects. 

The structure of the T-TE di-domain provides a starting point for 
elucidating the communication between domains in NRPS systems. 
Understanding how the central carrier domains interact with their 
partner catalytic domains is a prerequisite for understanding the 
NRPS assembly logic in chain initiation, elongation and termination 
steps. Our data document a well-defined, yet dynamic, interface 
between the T and TE domains, consistent with the idea that the T 
domain must interact with the upstream C and A domains as well as 
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Figure 4 | Interaction with the C domain. Many resonances experience 
shifts on addition of the C domain (bottom), indicating a modulation of the 
environment of the corresponding residues. Those belonging to the T 
domain form an interaction surface that does not overlap with the T-TE 
interface (top). Thus, no disruption of the T—-TE interaction is observed. The 
effects on the TE domain may be due to weak, indirect interactions with the 
C domain, or to a secondary effect due to fluctuations of the dynamics or 
modifications of the conformations in these regions. Residues with shifts 
larger than one s.d. from the mean (red line, bottom) are colour coded with a 
gradation from white to red. 
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this downstream TE domain. The PPTase and C domain, both of 
which must visit the T domain, modulate the interaction between the 
T and TE domains as well as the conformational and/or dynamic 
properties of each domain. Notably, the in trans interaction between 
surfactin TEII and the TycC3 T domain uses a different domain 
orientation but places the two active sites at a comparable distance’. 
Moreover, the in trans domain interaction selects subsets of confor- 
mational states similarly to the in cis situation described here. NRPS 
domains need to communicate with each other at specific points 
during the peptide synthesis, hence natural product assembly lines 
must be dynamic entities. The observed malleability may serve to 
regulate this interplay by stabilizing a given interaction during the 
course of the corresponding reaction. 


METHODS SUMMARY 


Recombinant protein was produced in an E. coliexpression system. The Ser48Ala 
mutation was used because it was difficult to produce exclusively holo- or apo- 
T-TE, rather than a mixture, in bacterial expression systems tested. Moreover, 
wild-type apo-T-TE exists in two inter-converting conformational states for the 
T domain (data not shown); a similar phenomenon is observed for the apo-T 
domain of tyrocidin synthetase, TycC3, in which the Ser48Ala mutation locks the 
protein in a single conformation’. Out of the 337 backbone resonances and the 
associated side-chain resonances, 313 were assigned with transverse relaxation 
optimized spectroscopy (TROSY) versions of conventional experiments” some- 
times with non-uniform sampling™*. A double TROSY hNcaNH was recorded to 
facilitate the assignment**. Most distance restraints were obtained from time- 
shared nuclear Overhauser enhancement spectroscopy (NOESY) experiments”*. 
High-resolution structures were obtained with the program CYANA” (back- 
bone root mean square deviations (r.m.s.d.) of 1.2A for residues in secondary 
structure elements, with corresponding r.m.s.d. for the single domains at 0.6 A 
(T domain) and 1.0A (TE domain; Fig. 2 and Supplementary Fig. 3)). The 
presence of complex dynamic process prevented the use of residual dipolar 
couplings**”’. Titrations were conducted with a constant T-TE concentration 
and with decreasing concentration of the titrant, to ensure that no degradation 
was induced by the titration. The determination of solvent accessible residues 
was obtained by changing the solvent from HO to D,O with a PD10 column 
(GE healthcare). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Protein expression and purification. A pET30a+ plasmid containing the EntF 
T-TE gene, with a C-terminal His, tag, was transformed into E. coli BL21(DE3) 
cells. Isotope enrichment of protonated samples was produced by overexpression 
in M9 minimal media in H,O containing '"NH,Cl or '°C glucose or both. 
Deuterated samples (including methyl labelled samples) were prepared in DO 
containing *H-'*C glucose. Partially deuterated samples were prepared in recycled 
D,0 with °C glucose. ‘ILV methyl labelled’ samples, which are selectively pro- 
tonated on methyls of Leu, Valand Ile (61), were prepared as described in*®”, with 
a-ketobutyrate and «-ketoisovalerate precursors either uniformly enriched in 
°C, for assignments of side-chain signals, or only '°C enriched at the methyls 
of the precursors, for the NOE experiments. Samples selectively enriched in '"N 
for a given amino acid were prepared by supplying the medium with a single !°N- 
enriched amino acid and 19 unlabelled amino acids’. The growth times were 4h 
in H,0, 6 hin recycled D,O and 12 hin D,O. Induction times were 2 hin H,0, 4h 
in recycled DO and 6h in D,O. For all samples, the cells were lysed by sonication 
and the protein was purified using Ni-NTA resin (Qiagen) followed by FPLC 
using a Sephadex gel-filtration column (S75 or S200). The final sample concen- 
trations were approximately 300 UM in 20 mM phosphate buffer (pH = 6.7) with 
150 mM NaCl, and 1 mM EDTA and dithiothreitol. 

Assignment of backbone resonances. TROSY versions of backbone experi- 
ments”***"* were used in a 70% deuterated *H-'°N-'°C sample. This led to the 
assignment of 50% of the signals, 30% of which were assigned with the automated 
assignment program IBIS**. HN-TROSY-HSQC” spectra were recorded on samples 
selectively enriched in '°N for Ile, Leu, Val, Lys, Phe, Tyr and Gly (that partially 
scrambles into Ser). These spectra were combined with the 'H, '°N projections of a 
CCONH spectrum ofa ILV methyl-labelled sample and with the 'H, '°N projections 
of an HNCOCACB spectrum. These combinations enabled the identification of Ile, 
Leu, Val, Lys, Phe, Tyr, Gly and Ser residues that were neighbours of Ile, Leu, Val and 
Gly residues. Similarly, a non-uniformly sampled** HNCO spectrum was recorded 
ona sample uniformly labelled in '°N and selectively labelled on carbonyl carbons of 
prolines. In addition, a non-uniformly sampled double-TROSY-hNcaNH experi- 
ment” was designed to correlate triplets of amide nitrogens belonging to contiguous 
residues. All of the information together led to 70% of the backbone assignment. A 
different strategy was used to assign residues with non-exchangeable amide protons, 
which constitute 30% of the TE domain. Non-uniform sampling was used to record 
high-resolution HNCO, HNCA and HNCOCA spectra ofa '°"N—'*C-labelled sample 
at 600 MHz with a good signal-to-noise ratio in spite of the fast relaxation due to the 
sample protonation. A '*C-detected MQ-HACACO™ was then used to assign Ho. 
signals. These experiments were used together with a ‘°N-dispersed NOESY- 
TROSY-HSQC leading to the assignment of 93% of the backbone resonances 
(313 out of 337 residues). 

Assignment of side-chain resonances. HCCONH and CCONH” experiments 
were recorded on a 70% deuterated, uniformly '°N- and '*C-enriched sample. 
These experiments were also recorded on a ILV-methyl-labelled sample together 
with HCCH-total correlation spectroscopy (TOCSY)**. An HCCH-COSY” was 
recorded on a '°N—'*C-enriched sample to assign the signals of residues with 
non-exchangeable amide protons. Spin systems obtained with this experiment 
were related to backbone assignments by the MQ-HACACO experiment. 
Further assignments were obtained with NOESY spectra described below, after 
initial structure calculations. 

Measurement of NMR constraints. Torsion angles constraints were obtained 
with the program TALOS”. Most of the distance constraints were measured with 
time-shared three-dimensional (3D) TS-NOESY-HN-TROSY/NOESY-HC- 
PEP-HSQC and 4D HN-TROSY/HC-HSQC-NOESY-HN-TROSY/HC-PEP- 
HSQC experiments”® with a mixing time (t,,) of 200 ms, recorded on a ILV- 
methyl-labelled sample that also contained protonated phenylalanine"'. A time- 
shared HSQC-NOESY (D.P.F., unpublished data), with high resolution for the 
dimension featuring the cross-peaks, was recorded to provide accurate integ- 
ration of the NOE signals. A NOESY-HN-TROSY spectrum (t,, = 90 ms) was 
recorded on a protonated sample to measure restraints involving non-exchange- 
able amide protons. 

NMR data acquisition. All experiments were conducted on Bruker spectro- 
meters equipped with cryoprobes. For the assignment of backbone resonances, 
experiments were recorded on a 900 MHz spectrometer, except for those invol- 
ving '°C-'°N samples and selectively labelled samples, which were measured at 
600 MHz, and the MQ-HACACO experiment, which was recorded on a 
500 MHz spectrometer equipped with a TXI cryoprobe. Side-chain experiments 
were measured at 600 MHz, except for the HCCH-COSY, which was recorded at 
900 MHz. NOESY spectra were recorded at 750 MHz (3D and 4D time-shared 
experiments) and 900 MHz (3D time-shared HSQC-NOESY and 3D NOESY- 
TROSY). All spectra were processed with nmrPipe* and analysed with CARA”. 
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Structure calculation. The structures of each domain were first solved individu- 
ally, with constraints measured on the T-TE fragment. For the T domain, a 
homology model with the TycC3 T domain (PDB code 1JMK”*) could be pro- 
duced“*. For the TE domain, sequential alignment with the surfactin TEI (1DNY’”) 
provided erroneous results. However, we could use secondary structure align- 
ment, on the basis of TALOS predictions and NOE patterns, to produce a model 
with the program DeepView***’. The program CYANA” was used to predict 
NOEs on the basis of these models. The quality of the model could immediately 
be inspected in the various NOESY spectra and numerous corrections were made, 
in particular with help from 4D spectra. This resulted in a first low-resolution 
structure (r.m.s.d. of about 4 A in each domain and 10 A for the T-TE fragment). 
This structure was then used to identify new distance constraints and to make 
further corrections. This was repeated for each new structure. One-hundred-and- 
eighty-one hydrogen bond constraints were added for residues in secondary struc- 
tured regions, once the local r.m.s.d. were shown to be under 0.5 A, and 497 torsion 
angle constraints were included when they supported the existing structure. In the 
end, 4,383 distance constraints (1,163 intraresidues, 2,546 medium range and 935 
long range) with 34 interdomain distance constraints were used to produce a 
structure with a global r.m.s.d. of 2.2 A. The structural refinement in explicit water 
was performed using the RECOORD* scripts, after converting the CYANA 
derived structures into the CNS/XPLOR” format. 

Titrations. See Supplementary Information. The reported chemical shift differ- 
ences were calculated by Ad(7H, '°N) = ((A@y)* + ((1/10)A@y)*)”?, where 
Aq; is the chemical shift difference between the two species for nucleus i. A red 
dashed line on the resulting figures indicates a threshold of 1 s.d. that was used to 
single out outliers. 
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Structural basis for the selectivity of the external 
thioesterase of the surfactin synthetase 
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Mohammad R. Mofid*, Peter Giintert!”, Gerhard Wagner’, Christopher T. Walsh?, Mohamed A. Marahiel* 


& Volker Détsch! 


Non-ribosomal peptide synthetases (NRPS) and _ polyketide 
synthases (PKS) found in bacteria, fungi and plants use two dif- 
ferent types of thioesterases for the production of highly active 
biological compounds’’. Type I thioesterases (TEI) catalyse the 
release step from the assembly line’ of the final product where it 
is transported from one reaction centre to the next as a thioester 
linked to a 4'-phosphopantetheine (4’-PP) cofactor that is cova- 
lently attached to thiolation (T) domains*”. The second enzyme 
involved in the synthesis of these secondary metabolites, the type 
II thioesterase (TEID), is a crucial repair enzyme for the regenera- 
tion of functional 4’-PP cofactors of holo-T domains of NRPS and 
PKS systems'®’*. Mispriming of 4’-PP cofactors by acetyl- and 
short-chain acyl-residues interrupts the biosynthetic system. 
This repair reaction is very important, because roughly 80% of 
CoA, the precursor of the 4’-PP cofactor, is acetylated in bacteria’. 
Here we report the three-dimensional structure of a type II thioes- 
terase from Bacillus subtilis free and in complex with a T domain. 
Comparison with structures of TEI enzymes*"™* shows the basis for 
substrate selectivity and the different modes of interaction of TEII 
and TEI enzymes with T domains. Furthermore, we show that the 
TEII enzyme exists in several conformations of which only one is 
selected on interaction with its native substrate, a modified holo-T 
domain. 
The cyclic lipoheptapeptide surfactin is one of the most potent bio- 
surfactants showing antibacterial and antiviral activity**””. It is syn- 
thesized by a complex of three large surfactin synthetase subunits that 
consist of either three modules (subunits SrfA-A and SrfA-B) or one 
module (subunit SrfA-C; Fig. 1), with each module being responsible 
for the addition of one amino acid’’. The transport of the growing 
chain between individual modules is achieved by small ~80-amino- 
acid-long T domains that interact with the adenylation (A) domain 
and both the upstream and downstream peptide-bond-forming con- 
densation (C) domains*®*'®. In addition, some of the T domains of 
the surfactin synthetase interact with epimerization domains, located 
at the carboxy termini of the subunits SrfA-A and SrfA-B, or with the 
covalently linked TEI’. The TEI catalyses the macrolactone formation 
between Leu7 and the B-hydroxy fatty acid to release the mature 
surfactin. Modifications blocking the reactive thiol group of the 4’- 
PP cofactor attached to any T domain can occur with small molecules 
present in the cell (acetylation, succinylation and modification with 
fatty acids) and pose a challenge for the organism to keep the assem- 
bly line running. 

The surveillance and repair tasks for the surfactin assembly line are 
carried out by the stand-alone surfactin type II thioesterase 


(SrfTEII)'°"*. The importance of this enzyme has been demonstrated 
by genetic deletions that reduce the production of surfactin by 84%”. 
Owing to the large variety of acylation modifications and the fact that 
SrfTEII has to be able to interact with all seven T domains of the entire 
assembly line, this TEII has to be—in contrast to the TEI at the end of 
the last module—rather nonspecific. At the same time, premature 
cleavage of the correctly growing peptide chain has to be avoided. In 
addition to this repair function, SrfTEII might also be responsible for 
loading the B-hydroxy fatty acid onto the first C domain. 

SrfTEII shows the typical «-/B-hydrolase fold, with a central 
7-stranded B-sheet surrounded by 8 helices (Fig. 2). Comparison 
with the structure of the cis-acting SrfTEI shows a similar overall 
architecture*, but with notable differences (Supplementary Fig. 5). 
The most obvious alteration is the amino-terminal truncation of 
ot-helix (Asp 60—Leu 67). Further modifications include the insertion 
of an extra helix (helix 7; Cys 193-Trp 200) between the active site 
residues Asp 189 and His 216, and repositioning of the helix—turn— 
helix motif canopying the active site in SrfTEI. This lid is moved from 
its central position observed in the SrfTEI crystal structure* towards 
the B-strands 6 and 7 in SrfTEII. Consequently, the active site resi- 
dues in the SrfTEII structure (Ser 86, Asp 189 and His 216)'°"' are just 
partially covered by a short loop (Gln 138—Ala 144) and are more 
accessible compared to SrfTEI. The space surrounding the active site 
is further enlarged by a kink in one of the helices of the helix—turn— 
helix motif. 

Many of the amide proton resonances of SrfTEII show line broad- 
ening or even two separate peaks, suggesting that several conforma- 
tions can occur (Supplementary Fig. 2). Recently, we described the 
existence of several conformations in the TycC3 T domain’ (the third 
module of the tyrocidine A synthetase subunit C, a substrate for 
SrfTEII) and suggested that this conformational exchange is an 
important driving force for the selective interaction with other 
domains of the NRPS system*®’®. 

NMR titration experiments! of '°N-enriched SrfTEII were per- 
formed with CoA, myristoyl-CoA, apo- and holo-TycC3 T domains. 
The resulting chemical shift perturbations (Supplementary Fig. 3) 
showed interaction with the holo form and virtually no interaction 
with the apo form in agreement with previous experiments in which 
the chemical shift differences of the altered T domain forms were 
analysed°. Notably, myristoyl-CoA also binds to SrfTEII, suggesting 
recognition of this B-hydroxy fatty acid. This might support the 
previously assumed second function of SrfTEII as the starter enzyme, 
responsible for loading the B-hydroxy fatty acid onto the first C 
domain’. A comparison of the interaction interface of SrfTEII with 
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the in cis T recognition of EntF T-TEI'*”’ shows that in particular, the 
sequentially diverse helix—turn—helix motif and its connecting loops 
are important for selecting the specific substrate. 

To obtain more detailed insight into the interaction between 
SrfTEII and the TycC3 T domain, we carried out further NMR titra- 
tion experiments with the acetyl-holo form of the T domain and the 
inactive point mutation Ser86Ala of SrfTEII (Supplementary Fig. 4). 
On the basis of the magnitude of the chemical shift differences, the 
acetyl-holo-T domain shows a stronger interaction with SrfTEII than 
the holo-T domain, again in agreement with previous experiments’. 
To determine the relative orientation of both domains, we measured 
nuclear Overhauser effects (NOEs) between selectively protonated 
methyl (Ile, Leu and Val) and aromatic (Phe) groups of SrfTEII and 
amide protons of the TycC3 T domain in otherwise perdeuterated 
proteins*”'. These NOEs and the chemical shift perturbations 
allowed us to calculate the structure of the complex between the 
TEII enzyme and the T domain (Fig. 3b). Previous investigations 
had revealed that the holo form of the TycC3 T domain exists in 
two different conformations that differ in the relative orientation 
of o-helix 2 and the positioning of the 4’-PP cofactor®. They also 
demonstrated that interaction with the SrfTEII enzyme selects the 
more open conformation, the holo-form specific H state’. Notably, 
modification of the 4'-PP cofactor with an acetyl group also shifts the 
equilibrium and locks the T domain in the H state. Consequently, we 
used the H state of the TycC3 T domain in our structure calculations. 
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The interface between both proteins involves o-helix 2, parts of 
o-helix 1 and the C terminus of the T domain, in agreement with 
the results reported previously’. The interface of the SrfTEII enzyme 
includes the N-terminal loop, the loops between f-strand 2 and 
o-helix 1 (Phe 19-Gly 23), between B-strand 4 and o-helix 3 
(Gly 88-Met 90), between f-strand 6 and o-helix 7 (Asp 189, 
Asp 190), the loop preceding o-helix 8 (Met217, Phe218 and 
Gln 222) and the helix—turn—helix ‘lid’ region located between 
B-strands 5 and 6 (Fig. 3). This arrangement positions the thiol- 
group of the 4’-PP cofactor in close proximity to the active site 
residues of SrfTEII. The rest of the cofactor is surrounded by hydro- 
phobic amino acids (Fig. 3)°. The helix—turn-helix motif is involved 
in the recognition of o-helix 2 of the T domain, with close contacts to 
the peptide sequence surrounding the active site Ser 45, as suggested 
previously” and observed as well in the EntF T-TEI interaction”. 
The titration experiments further demonstrated that interaction 
with the acetyl-holo-T domain selects one of the conformations of 
the thioesterase as indicated by the disappearance of the double peaks 
in the heteronuclear single-quantum coherence (HSQC) spectrum of 
the complex (Supplementary Fig. 4). Mapping the sites of the double 
peaks onto the structure of the TEII reveals that amino acids in the 
helix—turn—helix motif, loops surrounding the active site and resi- 
dues located at the N termini of the first and last helix are affected 
(Supplementary Figs 1 and 4). Unfortunately, the regions with dou- 
ble peaks are separated by stretches that display only one chemical 
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Figure 1| Assembly line of the surfactin non-ribosomal peptide 
synthetase. The surfactin synthetase consists of the three subunits SrfA-A, 
SrfA-B and SrfA-C that together comprise seven modules, each being 
responsible for the incorporation of one amino acid residue. The release of 
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the lipoheptapeptide is catalysed by TEI attached to SrfC (module 7). The 
function of SrfD, the external thioesterase TEII (lower left), is the recycling 
of misprimed T domains. The 4’-PP cofactor is depicted shortened, attached 
to the T domains. Ac, acetyl. 
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Figure 2 | Mean structure of SrfTEII. a, b, Ribbon diagrams of the front 
(a) and back (b) views are shown. The mean structure is derived from a final 
ensemble of the best 20 structures out of 150 structures calculated with 
CYANA and refined in explicit water using the RECOORD scripts and 
CNS1.1. The structure is colour coded from blue (N terminus) to red (C 
terminus). 


Figure 3 | Complex structure of SrfTEII and the TycC3 T domain. 

a, Interaction surfaces (red) for SrfTEI (top) and the TycC3 T domain 
(bottom) are based on chemical shift perturbations observed in NMR 
titration experiments of '*N-labelled Ser86Ala SrfTEII with unlabelled 
acetyl-holo-T domain and vice versa. The interaction surface identified on 
the T domain is identical to results published previously”. b, Ribbon diagram 
of the refined average structure of the complex of SrfTEII and the TycC3 T 
domain calculated using the simulated annealing protocol of CNS1.1. 
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shift value, making an unambiguous assignment to one particular 
conformation impossible. Thus, the structure of the SrfTEII repre- 
sents an intermediate between two distinct conformations. The iden- 
tity of the regions showing this conformational exchange combined 
with the selection of one state by binding of the T domain, however, 
allowed us to build a model of the exchange process. The two differ- 
ent states probably represent an open and a closed conformation of 
the enzyme (Supplementary Fig. 7). A similar plasticity of the lid 
region has been observed in the crystal structure of the SrfTEI enzyme 
in which two molecules with different conformations are located in 
the asymmetric unit*. The main difference between both molecules is 
the position of the lid, with one conformation representing a more 
open and the other a more closed state. Because both molecules, 
however, also form a non-native dimer in the crystal structure, the 
importance of this observation was not clear. The detection of an 
equilibrium of distinct conformations in the SrfTEII enzyme, which 
shifted towards one conformation by the interaction with a modified 
holo-T domain, is the first confirmation of such a conformational 
exchange process in solution. Similar exchange processes are also 
observed in the structure of the EntF T-TEI di-domain”. 

The complex structure further shows the structural basis for the 
recognition of short acyl groups attached to the 4'-PP thiol. 
Comparison of the active sites of the TEII and TEI enzymes shows 
that, despite the wide opening of the active site of TEII, the space 
available to accommodate a group attached to the 4'-PP cofactor is 
limited (Fig. 3 and Supplementary Figs 6 and 8). The open conforma- 
tion of the SrfTEI enzyme is characterized by a pronounced active site 
cavity with a volume of 630 A®. Modelling of a heptapeptide on the 
basis of residual electron density has shown that this cavity is large 
enough to accommodate the entire peptide and to enforce a con- 
formation that allows cyclization. In contrast to the deep, bowl- 
shaped cavity of the TEI enzyme, the active site of SrfTEII is embed- 
ded in a shallow groove that can accommodate only small acyl sub- 
stituents on the 4’-PP cofactor (Supplementary Fig. 8). This 
specificity of TEIIs for small acyl substrates was previously demon- 
strated in kinetic studies with TycF, the TEII of the tyrocidin synthe- 
tase’’, and with SrfTEH”’. To investigate this selectivity further, we 
performed titration experiments with '"N-labelled SrfTEII and the 
unlabelled TycC3 T domain loaded with either a single amino acid 
(Ala) or a tripeptide (Phe-Pro-Phe). Titration with the Ala-loaded T 


Surfaces are displayed only for the residues of SrfTEII (blue) and the TycC3 
T domain (red) showing intermolecular NOEs. Residues showing chemical 
shift changes are coloured in the ribbon diagrams accordingly. Some of the 
active site residues of the TEII triad (Ser 86 and His 216) and the modified T 
domain active site (Ser 45-4'-PP) showed very severe line broadening 
(Supplementary Figs 3 and 4). The position of the 4’-PP cofactor shown is 
based on the position in the free H state of the TycC3 peptidyl carrier protein 
(TycC3-PCP)’. 
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Figure 4 | Size limitation in substrate recognition by SrfTEII. The 
combined '°N and 'H chemical shift perturbation (Ap.p.m.) of °N-HSQC- 
based NMR titration experiments of '°N-enriched SrfTEII with the 
unlabelled TycC3 T domain loaded either with an alanine residue non- 
hydrolysably bound to the 4'-PP cofactor (a) or with the tripeptide Phe-Pro- 
Phe (b) show a selectivity for small substrates by SrfTEII. The interaction of 
SrfTEII with the Phe-Pro-Phe-holo-TycC3 T domain is limited to residues 
also involved in holo-T domain recognition, which supports the described 
selection of small substrates driven by the limited size of the cavity. Residues 
with resonances broadened beyond detection in the 1:1 SrfTEI:Ala-holo- 
TycC3 T domain complex are indicated by red bars. Chemical shift 
differences of amino acids showing chemical shift perturbations are shown 
as blue bars. The surface of the SrfTEII protein is coloured accordingly. 


domain showed results very similar to those with the acetyl-holo-T 
domain with the active site and surrounding residues showing chem- 
ical shift changes or line broadening. In contrast, the titration with 
the tripeptide-loaded T domain resulted in only minor chemical shift 
differences, far from the active site (Fig. 4). The titration results also 
indicate differences in the dynamic behaviour between acetyl-holo-T 
and Ala-holo-T on the one hand, and holo-T and peptide-loaded 
holo-T on the other. Whereas titrations with the first group indicate 
the formation of a stable complex by selection of one of the confor- 
mations represented by double peaks, titrations with the second 
group show a less pronounced selection of one conformation and 
only limited chemical shift differences. 
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In addition to the shape and volume of the active site, the inter- 
action of the enzymes with the 4’-PP cofactor might also differ. In the 
SrfTEI structure the 4’-PP cofactor is almost completely surrounded 
by a channel formed by the TEI’, whereas it seems to be sandwiched 
between SrfTEII and the T domain in the in trans complex, dem- 
onstrating again the wider opening of the active site. 

The structures reported here and their comparison with TEI 
enzymes show how modulation of the conserved thioesterase fold 
is used to change the function of the enzyme from one that recognizes 
the final product of the assembly line to one with a shallow but easily 
accessible active site that provides a rather unspecific but indispens- 
able repair function. 

Recently crystal structures of type I fatty acid synthetases°** have 
shown the complex interaction between individual domains in these 
multi-enzymatic assemblies, confirming the central role that the T 
domains and different orientations of their cofactors have in the 
iterative substrate shuttling between active sites. 


METHODS SUMMARY 


The C-terminal His¢-tagged wild-type SrfTEII and the inactive Ser86Ala mutant 
proteins were heterologously expressed in Escherichia coli and purified by Ni- 
chelation affinity chromatography. All isotope enriched protein samples were 
produced in supplemented M9 minimal media with selectively labelled carbon 
and nitrogen sources. NMR spectra’*’’for backbone and side chain resonance 
assignment, and structure calculation, were recorded on Bruker Avance800 and 
Avance900 spectrometers. All NMR titration experiments'® of labelled protein 
samples with small molecules and unlabelled proteins were performed on a 
Bruker Avance700 spectrometer. Bruker XWINNMR or Topspin 1.3 was used 
for data processing and UCSF SPARKY 3.111 for resonance assignment and 
NOE integration. The structure of SrfTEII was calculated on the basis of 2,442 
NOE upper distance limits and constraints for 92 hydrogen bonds in regular 
secondary structure elements using the program CYANA 2.1 (ref. 28). The 20 
conformers with the lowest target function values were energy refined in explicit 
water using the RECOORD scripts” and the CNS 1.1 protocol”, to represent the 
solution structure of SrfTEIL. All distance constraint violations were smaller than 
0.2 A. The in trans complex structure was calculated using the CNS 1.1 simulated 
annealing protocol. The contact surfaces of both proteins were identified by 
NMR titration experiments of labelled protein samples with the corresponding 
unlabelled interacting protein. Constraints to describe the relative domain ori- 
entation were obtained from a three-dimensional (3D)-'°N-nuclear Overhauser 
enhancement spectroscopy (NOESY)-total correlation spectroscopy (TROSY) 
spectrum ofa complex of '°N-labelled, perdeuterated holo-TycC3 T domain and 
fully perdeuterated, selectively Phe, Ile, Leu and Val protonated SrfTEII (refs 20 
and 21). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Protein expression and purification. The SrfTEI protein (residues 1-241) and 
its inactive Ser86Ala mutant were heterologously expressed in the E. coli strain 
BL21 (DE3) using a pET-expression vector system containing a C-terminal His,- 
tag. They were purified by Ni-chelation affinity chromatography and subsequent 
gel filtration chromatography using a Pharmacia Superdex-75 column. The 
purity of all protein samples was validated by SDS-PAGE analysis. All labelled 
samples of the SrfTEII protein were produced in supplemented M9 media with 
stable isotope enriched glucose ("°C or 'C/?H; Cambridge Isotopes 
Laboratories) as the only carbon source and '"N ammonium chloride 
(Cambridge Isotopes Laboratories) as the nitrogen source. For the preparation 
of fully perdeuterated samples the aqueous solvent was replaced by DO and 
perdeuterated glucose was used as the carbon source. The stable isotope enriched 
CH, '°N, °C) and unlabelled samples of the TycC3 T domain (1-87) were 
expressed and purified as described previously’. 

NMR data acquisition and assignment. All NMR spectra for backbone and side 
chain resonance assignment and structure calculation of the SrfTEII protein 
were recorded on an Avance800 or an Avance900 spectrometer equipped with 
a 5mm triple resonance, z-gradient cryogenic probe at 298 K. The resonance 
assignment was based on standard triple-resonance experiments, following 
standard protocols**?”*!~*, An additional 4D '°N—'°N-resolved NOESY spec- 
trum of '°N-labelled and 70% perdeuterated SrfTEII was recorded to verify the 
sequential resonance assignment and to define unambiguous amide—amide 
based distance constraints. DSS (4, 4-dimethyl-4-silapentane-1-sulphonate) 
was used as an internal chemical shift reference. All '"N-HSQC based NMR 
titration experiments'® were performed on an Avance700 spectrometer at 
296K. The spectral width was set to 12.5 p.p.m. in the proton dimension and 
35 p.p.m. in the nitrogen dimension. A total of 2,048 points in the direct and 
1,024 points in the indirect dimension were collected for all HSQC spectra. The 
TROSY-based '°N-edited NOESY experiments to measure interface distance 
constraints in the complex of Ser86Ala SrfTEII with the acetyl-holo-TycC3 T 
domain (1:1 ratio) and wild type SrfTEII with the holo-TycC3 T domain (1:2.5 
ratio) were recorded on an Avance900 with a mixing time of 182 ms. XWINNMR 
3.1 or Topspin 1.3 (Bruker) were used for processing and SPARKY 3.111 (T. D. 
Goddard and D. G. Kneller, SPARKY 3, University of California) for resonance 
assignment and NOE peak integration. The SrfTEII protein could be assigned to 
92% completeness of all 227 non-proline backbone resonances and 91.5% of all 
resonances. 

Structure calculation and refinement. Structures of SrfTEII were calculated by 
simulated annealing in torsion angle space with the program CYANA 2.1 (refs 28 
and 34). Refinements in explicit water (TIP3P) were performed using the 
RECOORD scripts” and the CNS 1.1 protocol”. In all calculations 210 unam- 
biguously identified backbone amide to amide contacts from a 4-dimemsional 
'N/'°N-resolved NOESY experiment’ were applied with upper distance 
bounds of 6.5 A. The secondary structure was defined by 184 distance constraints 
for 92 backbone hydrogen bonds identified on the basis of unambiguous NOEs 
in a 3D-'°N-resolved NOESY spectrum and by 278 torsion angle constraints 
derived from chemical shift values with the program TALOS”*. Furthermore, 
2,442 distance constraints were generated on the basis of 7,802 NOESY cross 
peaks from the same 3D-'°N-resolved NOESY and three 3D-'*C-resolved 
NOESY spectra recorded for aliphatic, methyl and aromatic side chains sepa- 
rately. An initial structure was calculated from 2,851 manually assigned NOEs 
and the torsion angle constraints with CYANA 2.1. The calculation yielded a 
structural bundle (20 out of 100 calculated structures, sorted by lowest target 
function value) with a precision (root mean square deviation, r.m.s.d.) of 3.8 A 
and an average target function value of 1.9 A?. The CYANA script with automatic 
NOE assignment was used with the resulting 1,706 distance constraints of the 
previous calculation and an extra 94 distance constraints obtained from manu- 
ally assigned CH-NOEs selective for aromatic side chains. The interpretation of 
4,951 ambiguous NOEs resulted in an additional 642 distance constraints and in 
a structural bundle with an r.m.s.d. of 1.35 A for all heavy atoms and an average 
target function value of 4.9 A? for the 20 structures with lowest CYANA target 
function values out of 150 calculated conformers. The final set of 20 out of 150 
calculated structures does not show distance or van der Waals violations larger 
than 0.20 A, no angle violations >2.6° and 75% of all dihedral angles are located 
in most favoured regions and 24.8% in additionally allowed regions of the 
Ramachandran plot. Pockets in SrfTEI (ref. 3) and SrfTEII were identified and 
their sizes computed using the CASTp algorithm**” with a probe radius of 2.0 A. 
NMR titrations and complex structure calculation. Chemical shift perturba- 
tions were measured in '"N-HSQC-based NMR spectra for titration experi- 
ments'® of '°N-labelled holo- and acetyl-holo-TycC3 T domain with 
unlabelled SrfTEII and vice versa. The chemical shift perturbations and the line 
shapes of all titration experiments were analysed using MestReC 4.9.9.6 (ref. 38). 
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It has been demonstrated previously that the H state of the TycC3 T domain is 
recognized as a substrate by the SrfTEII (ref. 5). Interaction surfaces of the carrier 
protein and the thioesterase were identified by the titration experiments and the 
structures of both proteins were used to calculate the structure of the enzyma- 
tically active complex. Constraints to describe the relative domain orientation 
were obtained from a 3D-'°N-resolved NOESY-TROSY spectrum of a 
'°N-labelled completely perdeuterated the holo-TycC3 T domain and fully per- 
deuterated selectively Phe, Ile, Leu and Val protonated SrfTEII (refs 20 and 21). 
Nine unambiguous and seventeen additional ambiguous constraints between 
the TycC3 T domain amide protons and SrfTEII FILV-side-chain protons were 
identified. Distance constraints on the basis of these NOEs were applied to the 
structural calculation of the in trans di-domain complex using the CNS 1.1 
protocol. All structural figures were prepared using UCSF CHIMERA 1.2470 
(ref. 39). 

In vitro modification of the TycC3 T domain. The in vitro modification of the 
unlabelled apo-TycC3 T domain was carried out in 2 ml reaction mixtures of 
0.25 mM apo-TycC3 T domain, 0.5mM acetyl-CoA (Sigma), 20 uM Sfp and 
5mM MgCh, buffered in 100 mM sodium phosphate at pH 8.0 for 45 min at 
room temperature. The reaction mixture was subsequently purified by desalting 
(Econo-pac 10 DG desalting column (BioRad)) and concentrated using an 
Amicon Ultra 4 Ultracell — 5k (Millipore) filter device with a molecular weight 
cutoff of 5 kDa. 

Synthesis of peptidyl-amino CoA substrates. The general procedure for the 
synthesis of peptidyl-amino CoA substrates was based on a synthesis described 
previously*"’. In brief, 10 mmol of amino-CoA”, 15 mmol of PyBOP and 
40 mmol of potassium carbonate were added to 10 mmol of the Boc-protected 
peptide. The solids were subsequently dissolved in a 1:1 THF/water mixture 
(500 ml total) and allowed to stir at room temperature overnight. The reaction 
mixture was directly purified by preparative high-performance liquid chromato- 
graphy (HPLC) using a single injection on a Phenomenex C18 250 X 21.2 mm, 
10mm, 100A column and eluting using a gradient of 0% to 60% acetonitrile 
containing 0.1% trifluoroacetic acid over 30 min and a flow rate of 10 ml min“! 
while monitoring at 260 nm. The identities of the Boc-protected peptidyl-amino 
CoA substrates were verified by HPLC-mass spectrometry (MS) and matrix- 
assisted laser desorption/ionization—time of flight (MALDI-TOF) mass spectro- 
metry. Next, cleavage of the Boc-protecting group was carried out by dissolving 
the Boc-protected peptidyl-amino CoA in a 95:2.5:2.5 mixture of trifluoroacetic 
acid, trifluoroethanol and water, and allowing this mixture to stir at room 
temperature for 2 h. The deprotected peptidyl-amino CoA substrates were puri- 
fied by preparative HPLC using the same conditions as described above. HPLC- 
MS and MALDI-TOF MS were used to confirm the identity of the peptidyl- 
amino CoA substrates. 
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Subject to change 


Getting to grips with scientific variables. 


Joseph Lachance 


Huang, L. et al. Comments re: Lee, V. et al. Science 
322, 123 (2009). 

“All efforts to repeat the results by Lee et al. 
have been unsuccessful. While it is under- 
standable that they lack our impeccable 
experimental technique, we believe that 
they failed to properly control for...” 


Jones, M. C. lrreproducibility. Philos. Sci. 78, 
143-151 (2011). 

“An increasing number of studies show a 
lack of reproducibility. In addition, accu- 
sations of scientific fraud have skyrocketed 
in the past year. Most famously, the 
dry-ice bomb incident between 
chemists at Oxford and Chicago led 
to no less than 7 arrests ...” 


Jimenez, F. Physical laws are local. Nature 
482, 432-435 (2014). 

“Research sites on either side of the 
Wallace Line consistently obtain 
different results. These experi- 
ments conclusively demonstrate 
that physical laws are not the same 
in different locations. It has not 
escaped our attention that this phe- 
nomenon may be responsible for 
numerous discrepancies in the past 
5 years” ’”. We have no choice but 
to conclude: what happens in Vegas, 
stays in Vegas...” 


Lapointe, A. The End of science? Time 186, 
24-31 (2015). 

“This is not your father’s or mother’s 
physics. At a recent conference in Zurich, 
physicists heatedly debated the objectivity 
of science. If physical laws do indeed vary 
from location to location, the foundations 
of science are on shaky ground. Matters 
were not helped when it was revealed that 
a cultural relativist accepted the Ultimate 
Boeing 747 gambit, only to glide safely 
and slowly down to earth from a height 
of 30,000 feet. Science as defined by Karl 
Popper is dead...” 


Orbach, R. et al. Aether or dark matter? 
Astrophys. J. 814, 78-90 (2016). 

“Signals from the pulsar J1342+2822B 
seem to have been modified in transit. 
Results are consistent with the effect of a 
gravitational lens, perhaps due to the exist- 
ence of dark matter. However, an alterna- 
tive explanation is that the signal passed 
through a region of space with different 
physical laws. If so, this would mean that 


the Jimenez effect is more pervasive than 
previously thought...” 


Kabaka, F. et al. Evidence of dynamic physical 
laws. PLoS Phys. 6, 1-17 (2017). 

“Results from supercollider experiments 
exhibited a marked change after 09:43 
GMT on 12 January 2017. Something 
clearly changed. Prior to 12 January, 
readings at CERN were 9.34+0.3 MeV. 
Subsequent readings were 5.34 +0.2 MeV. 
Interestingly, readings at Fermilab were 
consistently in the 9.34 MeV range. This 
suggests that physical laws are both local 
and dynamic...” 


Singh, N. A history of cellular automata. Phys. 
Rev. Lett. 118, 56-72 (2017). 

“Furore over dynamic physical laws has led 
to increased interest in cellular automata 
and artificial life. Cellular automata are 
spatial models in which states of cells 
within a grid are updated in discrete steps. 
Simple spatial rules, such as those of Con- 
way’s Game of Life, can generate complex 
results. Here, the state of a cell depends on 
the states of neighbouring cells. Patterns 
such as ‘blocks; ‘blinkers’ and ‘gliders’ have 
been found. Some patterns give the appear- 
ance of being alive. It is rumoured that 
the amount of computing time dedicated 
to the Game of Life is exceeded only by 
SETI@Home and World of Simcraft ...” 


Chen, X. & Li, H. Meta-rules and the search 

for universality. Preprint at <http://arxiv.org/ 
abs/0992.4174> (2019). 

“Recent discoveries have divided the scien- 
tific community into two warring camps. 
Localists believe the new paradigm that 
physical laws are local, whereas Universal- 
ists cling to the idea that physical laws are 
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universal. We suggest that local laws are 
determined by some universal meta-rule. 
This could bring the Localist-Universal- 
ist feud to an end, allowing resources to 
be diverted to empirical studies of Mur- 
phy’s law (a research programme that has 
garnered such prestigious awards as the 
IgNobel prize) ...” 


Williams, C. et al. Physical Laws are Cellular 
Automata. Proc. Nat! Acad. Sci. USA 116, 

78-92 (2019). 

“As shown in Figure 3, local laws are spa- 
tially and temporally correlated. Laws 
belong to a finite number of configura- 
tions and depend on laws present 
at nearby locations. This indicates 
that physical laws behave as cellular 
automata. Patterns of physical laws 
exhibit both periodicity and mobil- 
ity. Interestingly, “Rule 110’ seems to 
govern the behaviour of fringe ele- 
ments...” 


Abel, C. & Wilkins, O. Evolution of 
simulated organisms. Artif. Life 26, 
231-239 (2020). 

“Artificial life can exist in the form 
of cellular automata. Using a set of 
iterated, spatial rules, we simulated 
an ecosystem on a large cluster of 
microcomputers. For the first time, 
the existence of a virtual organism 
more complex than Escherichia coli 
has been demonstrated (see sup- 
plemental pages S1-S23 for definitions of 
complexity). It remains to be seen if sen- 
tience can evolve in silico...” 


Meme, A. Life within the laws of the Universe. 
www.stochasticity.scienceblogs.com (2021). 
“If 1 understand things correctly, this means 
that life, or something like it, exists on a 
truly grand scale. No, the Universe is not 
a giant game of Go. Instead, the rules of 
the Universe are flickering between states 
as if they are cellular automata. The bizarre 
events of the past few years seem to fit with 
a ‘glider’ passing through our local neigh- 
bourhood. Things have returned to normal, 
but for how long? If ‘gliders’ can exist, so too 
might other more complex patterns. Does 
this mean that the Universe itself is alive? 
I can only begin to wonder what strange 
organisms exist within the fabric of the 
Universe. Perhaps we are not alone!...”. ™ 
Joseph Lachance is a PhD student at 
Stony Brook University, where he studies 
theoretical population genetics and 
Drosophila genetics. 
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